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GENES FOR ASSESSING CARDIOVASCULAR STATUS 
AND COMPOSITIONS FOR USE THEREOF 

FIELD OF THE INVENTION 

The present invention relates to genetic polymorphisms and polymorphism 
patterns useful for assessing cardiovascular status in humans. More particularly, the 
invention relates to identifying and using polymoiphism patterns comprising a 
polymorphism in the a gene encoding a polypeptide selected from the group consisting of 
ACE, ATI, ACT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 
receptor, and p-adrenoceptor to predict a treatment outcome or likelihood of developing 
cardiovascular disease, and to assist in diagnosis and in prescription of an effective 
therapeutic regimen. 

BACKGROUND OF THE INVENTION 
The renin-angiotensin-aldosterone system (RAAS) plays an important role 
in cardiovascular physiology in mammals. Specifically, RAAS regulates salt-water 
homeostasis and the maintenance of vascular tone. Stimulation or inhibition of this 
system raises or lowers blood pressure, respectively, and disturbances in this system may 
be involved in the etiology of, for example, hypertension, stroke, and myocardial 
infarction. The RAAS system may also have other functions such as, e.g., control of cell 
growth. The renin-angiotensin system includes renin, angiotensin converting enzyme 
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(ACE), angiotensinogen (ACT), type 1 angiotensin II receptor (ATI), type 2 angiotensin II 
receptor (AT2) and aldosterone synthase. 

International Patent Application No. PCT/IB98/00475, filed April 1 , 1 998, 
discloses for the first time an association of polymorphism patterns in ACE, ACT, and 
ATI genes with cardiovascular status, particularly with the ability to predict the 
therapeutic outcome of a particular treatment regimen. 

Other genes express polypeptides involved in pathways other than RAAS 
that also play a role in the regulation of cardiovascular physiology, although it is believed 
that prior to the present invention, no known association of polymorphism pattems in 
these genes with cardiovascular status has been observed or reported. Among these other 
regulators of cardiovascular physiology are endothelin, endothelin receptor, and 
P-adrenergic receptors 1 and 2. 

RAAS Pathway Components 

AGT is the specific substrate of renin, an aspartyl protease. The human 
AGT gene contains five exons and four introns which span 13Kb (Gaillard et aly DNA, 
1989, 8:87-99; Fukamizu et aL J. Biol, Chem., 1990, 265:7576-7582). The first exon (37 
bp) codes for the 5' untranslated region of the mRN A. The second exon codes for the 
signal peptide and the first 252 amino acids of the mature protein. Exons 3 and 4 are 
shorter and code for 90 and 48 amino acids, respectively. Exon 5 contains a short coding 
sequence (62 amino acids) and the 3 -untranslated region. 

Plasma AGT is synthesized primarily in the liver and its expression is 
positively regulated by estrogens, glucocorticoids, thyroid hormones, and angiotensin II 
(Ang II) (Clauser et aL, Am. J. Hypertension, 1989, 2:403-410). Cleavage of the amino- 
terminal segment of AGT by renin releases a decapeptide prohormone, angiotensin-!, 
which is further processed to the active octapeptide angiotensin II by the dipeptidyl 
carboxypeptidase designated angiotensin-converting enzyme (ACE). Cleavage of AGT by 
renin is the rate-limiting step in the activation of the renin-angiotensin system. 

Several epidemiological observations indicate a possible role of AGT in 
blood pressure regulation. A highly significant correlation between plasma AGT 
concentration and blood pressure has been observed in epidemiological studies (Walker et 
aL, J. Hypertension, 1979, 1 :287-291), Interestingly, a number of allelic dimorphisms 
have been identified in the AGT gene. The frequency of at least two of them (174M and 
235T) have been partially characterized and in certain populations shown to be 
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significantly elevated in hypertensive subjects (Jeunemaitre et al. Cell, 1992, 71:169- 
180), In addition, a specific polymorphism, 23 5T, has been suggested to be directly 
involved in coronary atherosclerosis (Ishigami et al. Circulation, 1995, 91:951-4). 
Futhermore, the presence of A or G at position 1218 in the AGT regulatory region has 
been correlated with differences in in vitro transcriptional capacity for this gene (Inoue et. 
al., J. Clin. Invest., 1997, 99:1786). However, the foregoing are studies involving only 
one or at most two polymorphisms. Furthermore, the sole disclosed use is susceptibility to 
disease. 

The human ACE gene is also a candidate as a marker for hypertension and 
myocardial infarction. ACE inhibitors constitute an important and effective therapeutic 
approach in the control of human hypertension (Sassaho et al, Am. J. Med., 1987, 83:227- 
235). In plasma and on the surface of endothelial cells, ACE converts the inactive 
angiotensin I molecule (Ang I) into active angiotensin II (Ang II) (Bottari et al.. Front. 
Neuroendocrinology, 1993, 14:123-171). Another ACE substrate is bradykinin, a potent 
vasodilator and inhibitor of smooth muscle cell proliferation, which is inactivated by ACE 
(Ehlers et al. Biochemistry, 1989, 28:531 1-5318; Erdos, E.G., Hypertension, 1990, 
16:363-370; Johnston, C.I. Drugs (suppl. 1), 1990, 39:21-31). 

Levels of ACE are very stable within individuals, but differ greatly between 
individuals. A greater risk of myocardial infarction has been identified in a group of 
subjects with an ACE polymorphism designated ACE-DD (Cambien et al. Nature, 1992, 
359:641-644), and a 12-fold greater risk of myocardial infarction has been identified in a 
subgroup of patients having a combination of the ACE polymorphism ACE-DD and one 
of the AGT polymorphisms (235T) described above (Kamitani et al. Hypertension, 1994, 
24:381). Recently, six ACE polymorphisms were identified and characterized (Villard et 
al. Am. J. Human Genet., 1996, 58:1268-1278). 

The vasoconstrictive, cell growth-promoting and sah conserving actions of 
angiotensin II are mediated through binding to and activation of angiotensin receptors, of 
which at least two types have been cloned (ATI and AT2). The type 1 Ang II receptor 
(ATI), a G-protein-coupled seven transmembrane domain protein, is widely distributed in 
the body and mediates almost all known Ang II effects (Fyhrquist et al., J. Hum. 
Hypertension, 1995,5:519-524). 

Several polymorphisms have been identified in the ATI receptor gene. 
Initial studies suggest that at least one of them is more frequent in hypertensive subjects 
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(AT^*^C)(Bonnardeaux et a/.. Hypertension, 1994, 24:63-69). This polymorphism, 
combined with the ACE-DD polymorphism, has been shown to correlate strongly with the 
risk of myocardial infarction (Tiret etal. Lancet, 1994, 344:910-913). 

Endotheiin fET) Regulation of Cardiovascular Physiolog y 

Endothelin is a potent vasoconstrictive peptide characterized by long 
lasting action. It was first discovered as a vasoconstricting factor in conditioned medium 
(Hickey et al.. Am. J. Physiol., 1985, 248:C550), and subsequently purified and 
characterized (Yanagisawa ei al. Nature, 1988, 332:41 1). ET is produced as 
preproendothelin, which is cleaved after removal of the signal sequence by an 
endopeptidase, followed by cleavage with endothelin converting enzyme (Xu et a/., Cell, 
1994, 78:473; Shimada et ai, J. Biol. Chem., 1994, 269:18274). Analysis of the human 
£rgene has revealed the existence of two additional ET-like peptides expressed in various 
tissues, termed ET-2 and ET-3 (Inoue et al, Proc. Natl. Acad. Sci. USA, 1989, 86:2863). 
The first endothelin was accordingly termed ET-1. 

One of the important discoveries following characterization of ETs was the 
discovery of two ET receptors, ET^ and ETg (Arai et al. Nature, 1990, 348:730; Sakumi 
et al. Nature, 1990, 348:782). Both belong to the family of heptahelical G-protein 
coupled receptors. There is 68% amino acid identity between the two receptor subtypes. 
ET^ exists as a single copy gene located on human chromosome 4 (Hosoda et aL, J. Biol. 
Chem., 1992, 267:18797; Cyre/a/.,Biochem. Biophys. Res. Commun., 1991, 181:184). 
ETg exists as a single copy gene located on human chromosome 13 (Arai et ai, J. Biol. 
Chem., 1993, 268:3463-70), although a splice variant of ETg has been found (Shyamala 
et al. Cell. Mol. Biol. Res., 1994, 40:285-96). The cDNA sequence of ET^ has been 
deposited with GenBank with accession number S57498, 

In the early stage of ET research, a great number of pharmacological 
studies suggested that the responses to ETs could be divided into two groups according to 
the pharmacological potency of the three peptides. Indeed, these two receptors, ET^ and 
ETg, are distinct in their ligand binding affinity and distribution in tissues and cells. ET^ 
has a high affinity to ET-1 and ET-2, but a low affinity to ET-3. ETq has equally potent 
affinities to all three endogenous ETs. ET^ exists on smooth muscle and mediates 
vasoconstriction. In contrast, ET^ exists on endothelium and mediates the release of 
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relaxing factors such as nitric oxide and prostacycline. However, several reports 
demonstrated that ETq on some vascular smooth muscle also mediated vasoconstriction. 

Cloning of the ET receptor gene facilitated the development of ET-receptor 
antagonists, such as BE-18257B, and BQ-123 and FR139317, two derivatives of 
BE-18257B (See Masaki, Cardiovascular Res., 1998,39:530). Many selective and 
non-selective antagonists for ET^ and ETg have emerged. 

Although the pathophysiological role of ET is still unclear, ET antagonists 
demonstrated significant beneficial effects in pathological conditions, including congestive 
heart failure, pulmonary hypertension, cerebrovascular spasm after subarachnoid 
hemorrhage, acute renal failure, and essential hypertension. ET or ET receptor knockout 
mice have also provided important information regarding the physiological and 
pathophysiological significance of ET (Masaki, supra). In particular, mice with a 
knockout of ET-3 or the ETq receptor genes exhibit phenotypic changes that resemble 
Hirschsprung's disease, a human hereditary syndrome associated with a missense mutation 
of the ETs gene (Pfiffenberger, et aL, Cell, 1994, 79:1257), 

Despite advances in understanding the role of the endothelin pathway in the 
treatment of cardiovascular diseases, questions remain. Not all experimental models of 
hypertension respond to endothelin antagonists, and it remains a uncertain whether 
endothelin antagonists improve cardiac structure and function beyond the benefits of blood 
pressure reduction (Moreau, Cardiovascular Res., 1998, 39:534). Thus, there is a need in 
the art for a reliable and effective means for predicting whether endothelin antagonists will 
be effective for treating hypertension in a given individual. 

P'Adrenergic Receptors (P-Adrenoceptors) 

The adrenoceptors fall into three major groups, a,, ttj, P, within each of 
which further subtypes can be distinguished pharmacologically (LuUmann, et al. in Color 
Atlas of Pharmacology, New York, 1993), Adrenergic receptors are all G-protein linked. 
They are involved in regulation of the cardiovascular system, and in the control of 
metabolic activity, e.g., insulin secretion and glucose release. They also mediate 
constriction or relaxation of smooth muscle cells in the respiratory, gastrointestinal, and 
genitourinary tracts (Berne and Levy, Principles of Physiology (2"*^ Ed.), Mosby-Year 
Books, Inc., 1996, pp. 691-696). 
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Adrenoceptors are targets for epinephrine and norepinephrine, which are 
representatives of the family of monoamine neurotransmitters. Epinephrine has equally 
high affinity for all a- and p-receptors while norepinephrine differs from epinephrine by 
its low affinity for Pj-receptors (The Biochemical Basis of Neuropharmacology, (7*'' Ed.) 
New York, 1996, pp. 226-292), The adrenoceptors themselves interact preferentially with 
three different classes of G-proteins: (P-adrenoceptors) mediating activation of 
adenylate cyclase, Gj (aj-adrenoceptors) mediating inhibition of adenylate cyclase, and G^ 
(a, -adrenoceptors) mediating activation of phospholipase C (Hieble et ai, J. of Med, 
Chem., 1995,38:3415-3444). 

The pharmacological interest of adrenoceptors is mainly for the treatment 
of cardiovascular diseases, e.g., through the development of P-antagonists, a|-antagonisls 
and ttj-agonists to treat hypertension, but they are also considered important for the 
treatment of asthma (P2-agonists). 

The Pj-adrenergic receptor is expressed on a number of cell types, e.g., 
bronchial smooth muscle, where its activation results in relaxation and bronchial 
dilatation. These receptors are also being expressed on epithelial cells, vascular 
endothelium, alveolar walls, immune cells, and presynaptic nerve terminals (Liggett, Am. 
J. Respir. Grit. Care. Med., 1997, 156:S156-S162). Cardiac cells express mainly p,-, but 
also a small fraction of Pj-adrenoceptors (Collins et al, Biochimica et Biophysica Acta, 
1993, 1 172: 171-174). p,-adrenoceptors are also expressed in brain and pineal gland. 

^Adrenoceptor Function 
The P,- and p.-adrenergic receptors are coupled to a Gg-protein complex, 
which activates adenylate cyclase. Agonist binding to the P, -receptor, located in cardiac 
muscle ceils, mediates increased contractility and cardiac output (Lullmann, supra). 
Agonist binding to the Pj-receptor, located in peripheral vascular arteries, mediates 
vasodilation by increasing the amount of cAMP, and thereby inhibiting activation of 
myosin kinase, which is necessary for smooth muscle cell constriction {ibid.) Activation 
of cAMP in cardiac cells by agonist binding to both p,- and p-2-adrenoceptors activates the 
cAMP-dependent protein kinase (PKA) (Castellano and Bohm, Hypertension, 1997, 
29:715-722). P-adrenoceptors also regulate the control of melatonin production in the 
pineal gland, by the cAMP activation of one of the enzymes (5-HT-N-acetyl transferase) 
involved in the synthesis of melatonin (Collins et ai, supra). 
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The Pj-receptors mediate increased conversion of glycogen to glucose 
(glycogenolysis) in both the liver and skeletal muscle (H. Liillmann, et ai, supra), and 
stimulate influx of potassium into muscle cells to prevent hyperkalemia (Berne and Levy, 
supra). 

The P-receptors are regulated on the protein level by desensitization. The 
initial desensitization process results from the phosphorylation of serine and threonine 
residues in the cytoplasmic tail or third intracellular loop by several protein kinases, 
including PARK and PKA (Hieble et ai, supra), PARK phosphorylates specific serine or 
threonine residues in the C-terminal of receptors that are occupied by an agonist. The 
phosphorylation triggers binding of the cytostolic protein P-arrestin and results in the 
uncoupling from 0^.^. PKA is activated by cAMP and phosphorylates the P2-adrenoceptor 
by a relatively slow process. The phosphorylated receptor loses the ability to activate G, 
(Castellano and Bohm, supra). Prolonged interaction of agonists with adrenoceptors 
generally results in receptor desensitization. 

^Adrenoceptor Gene Structure 

The human p, -adrenoceptor gene is located on the long arm of chromosome 
10, the same chromosome as for the ttjA-adrenoceptor gene. The coding sequence of this 
gene is deposited with GenBank, accession number X69168. The regulatory region is also 
deposited with GenBank, accession number J03019. It codes for an intronless gene 
product of 1 43 1 base pairs (Hall, Thorax, 1 996, 5 1 :35 1 -353). Both the promoter and the 
coding region of the gene are rich in G and C residues, which make up greater than 70% of 
the bases. The promoter does not contain any paired consensus TATA box and CAAT 
box elements but instead clusters with an inverted CAAT box and SP, or AP-2 binding 
motifs. This type of receptor, reminiscent of "housekeeping genes", has been described 
for other G-protein coupled receptors as well (Collins et al., supra). 

The human p.-adrenoceptor gene is located on the long arm of chromosome 
5, the same chromosome as the a,y-adrenoceptor gene. The coding sequence has been 
deposited with GenBank, with accession numbers M 1 5 1 69, J02728, or M 1 6 1 06. The 
regulatory region sequence is also deposited with GenBank, accession number Y00106. It 
codes for an intronless gene product of 1239 base pairs (Hall, supra). The promoter region 
is 200-300 bases 5' of the translation initiation codon, and it can form strong secondary 
structures due to high G-C content. There are two TATA boxes (separated by roughly 10 
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bp) and a CAAT box located approximately 30 and 80 base pairs upstream, respectively, 
from the mRNA start region. 



^Adrenoceptor Gene Regulation 

There are some regulatory regions identified in the promoter region of the 
Pi -adrenoceptor gene: a cAMP response element (CRE), a consensus thyroid response 
element (TRE), and a glucocorticoid response element (GRE). This is consistent with the 
evidence that both thyroid hormone and corticosteroids affect adrenergic sensitivity in 
both heart and adipose tissue. The CRE region might have a self-regulatory function, as 
has been shown for the Pj-^drenoceptor gene (Collins et al., supra). 

There are several regulatory domains in the 5' flanking region of the 
Pj-adrenoceptor. Among these is a cAMP-responsive element (CRE), which is recognized 
and stimulated by a phosphoprotein called CRE binding protein (CREB). CREB is 
partially under the control of PKA-dependent phosphorylation processes. This is seen as 
an increase in Pj-adrenoceptor mRNA level in the early phase after exposure to P-agonists. 
However, the level of mRNA is decreased after prolonged exposure to agonists, probably 
mediated by a shortening of mRNA half-life (Castellano and Bohm, supra). It has also 
been shown that transcription of the Pj-adrenoceptor gene is upregulated by stimulation 
with glucocorticoids in a variety of tissues (Collins et al., supra). In the 3' flanking region 
there are sequences homologous to glucocorticoid response elements. These might be 
responsible for the increased expression of P2 adrenoceptor observed in transfected cells 
after treatment with hydrocortisone (Emorine and Marullo, Proc. Natl. Acad. Sci. USA, 
1987, 84:6995-6999). 

^Adrenoceptor Protein Structure 

The proposed model for p-adrenoceptors is like most of the G-protein 
binding receptors, a seven a-helical transmembrane structure, where the seven a-helices 
are radially arranged around a central "pore", in which the receptor ligands bind. The 
P-adrenoceptors have an extracellular glycocylated N-terminus, and an intracellular 
C-terminus. The P, -receptor consists of 477 amino acids; the P^-receptor consists of 413 
amino acids. 

The overall amino acid identity of human P,- and Pj- adrenoceptors is only 
54%. However, it is likely that the pharmacological differences between Pj-receptors and 
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P, -receptors are due to subtle changes in orientation of the primary binding sites, resulting 
in a slightly different binding site rather than to specific amino acid substitutions (Hieble 
et al, J. Med. Chem. 1995, 38:3415-3444). 

Site-directed mutagenesis has demonstrated that an aspartic acid residue, 
Asp-1 13, located in the third transmembrane-spanning helix, and two serine residues, 
Ser-204 and Ser-207, are required for full agonist binding to the Pz-adrenoceptor, The 
P, -adrenoceptor contains identical amino acid residues located in corresponding positions 
to those shown to be important for agonist binding to the P2-adrenoceptor. Another 
aspartic acid residue, Asp-79, located in the second a-helix of both P-receptors is highly 
conserved in G-protein coupled receptors (Hieble et al., supra), Ser-3 19 has a potential 
role in agonist binding to the P;>-adrenoceptor. 

Mutation of Tyr-350, located in the cytoplasmic tail of the Pj-receptor, 
interferes with coupling of the receptor to Gg (Hieble et al, supra). Also, palmitoylation 
of Cys-341 in the C-terminal enables the Pj-adrenoceptor to form a fourth 
intracytoplasmic loop, which increases the ability of the agonist-bound receptor to mediate 
adenylyl cyclase stimulation (Strosberg, Preotein Science, 1993, 2:1 198-1209). 

^Adrenoceptors as Drug Targ ets 
No cause of disease can be identified in 80-90% of patients with 
hypertension. They have so-called essential hypertension, which affects 5-10% of the 
general population, and is the most common cause of disease in developed countries ( J. 
Axford, Medicine, Blackwell Science Ltd., 1996, 10.1 19-10.130). Betablockers have been 
widely used in the treatment of hypertension. They are particularly useful for the 
treatment of juvenile hypertension with tachycardia and high cardiac output. Betablockers 
or beta-adrenergic blockers were first introduced as a treatment for essential hypertension 
in 1964, and are still recommended as first choice because the cost for betablockers is low, 
which improves patient compliance. They act by binding to P, -receptors on the cardiac 
smooth muscle cells, which leads to decreased cardiac output. Most betablockers are not 
specific Pi-receptor antagonists but binds to Pj-receptors as well. The binding to 
Pj-receplors gives the opposite of the desired effect though inhibition of Pj-receptors leads 
to vasoconstriction. This gives a side effect with cold hands and feet because most of the 
Pj-receptors are located in the peripheral vascular arteries. 
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They have also been known to cause bronchospasms as well as some 
central nervous system side effects (nightmares, somnolence). They decrease insulin 
secretion, which makes them inappropriate to treat hypertensives with diabetes mellitus, 
and they can cause heart failure and peripheral artery obstructive disease (Velaseco and 
Rodrigues, Journal of Human Hypertension 10, Suppl. 1, S77-S80, 1996), 

P-adrenoceptor agonists, such as dopamine and dobutamine are used to 
stimulate myocardial Pj -adrenoceptors in the acute management of congestive heart 
failure. They act by increasing contractility and cardiac output. 

A lot of different pj-agonists are used in the treatment of asthma. They 
exert their primary effect on the Pj-adrenergic receptor of bronchial smooth muscle, 
resulting in relaxation and bronchial dilatation. They also protect against 
bronchoconstrictor challenge (Hall, supra). 

Thus, there is a clear need in the art for an improved understanding of the 
effects of betablockers on different subjects, and to predict which patients will have a 
better response to treatment with betablockers. 

Need for Effective Cardiovascular Status Assessment 
The high morbidity and mortality associated with cardiovascular disease 
demonstrate a need in the art for methods and compositions that allow the determination 
and/or prediction of the therapeutic regimen that will result in the most effective treatment 
outcome in a patient suffering from cardiovascular disease. This includes identification of 
individuals who are more or less responsive to particular therapeutic regimens, including, 
e.g., particular drugs that are conventionally used to treat cardiovascular disease. 

Furthermore, the heterogeneity in responses to cardiovascular therapies 
emphasizes a need for another approach to rational drug development. In particular, 
populations that are identified as non-responsive to a particular therapeutic regimen can be 
identified for development and testing of alternative regimens. Thus, effective treatment 
regimens could be developed for a larger percentage of the affected population. 

In summary, there is a need to reduce or eliminate trial and error in 
selecting a therapeutic regimen for a particular individual. It would be desirable instead to 
predict whether a given individual will be responsive to, e.g., a particular class of drugs or 
even to a particular drug or whether he/she is likely to suffer from adverse reactions or 
side-effects. 



wo 00/22166 . «L 3^ . PCT/IB99/01678 

There is also a need in the art for methods and compositions that allow the 
identification of individuals having a predisposition to cardiovascular disease, such as, 
^•Sm myocardial infarction, hypertension, atherosclerosis, and stroke, to facilitate early 
intervention and disease prevention. 

The present invention addresses these and other needs in the art by 
providing polymorphisms and polymorphic patterns that are characteristic of 
cardiovascular status, and by using these polymorphisms and patterns to prescribe or to 
develop more effective treatments or to assist in diagnosis. 

Citation of any reference in this application should not be construed as an 
admission that the reference is prior art to the invention. Each cited reference is 
incorporated herein by reference in its entirety. 

SUMMARY OF THE INVENTION 
In one aspect, the present invention provides reagents and methods for 
predicting whether a particular therapeutic regime (such as a specific drug, a class of drugs 
or any other therapeutic regime, pharmacological or not) would be effective in improving 
a cardiovascular condition in a human individual, or would be ineffective for that purpose, 
or its use would be associated with adverse reactions or undesirable side-effects. 

A particular advantage of the invention is that one or more polymorphic 
markers provide a basis for predicting the outcome of a treatment regimen. By comparing 
a polymorphic pattern of a subject who requires treatment for a cardiovascular disorder, 
for example hypertension, with a reference pattern previously established to correlate with 
responsivity to the treatment regimen, a physician can predict whether a treatment plan, 
such as administration of an ACE inhibitor, is likely or not to be effective before 
subjecting the subject to the treatment plan. For example, a comparison of the test 
polymorphic pattern from an individual with reference polymorphic patterns of individuals 
exhibiting differing responses to a particular therapeutic intervention can be used to 
predict the type or degree of responsivity of the individual to such intervention. The 
present invention thus represents a significant breakthrough in treating cardiovascular 
pathologies in that it reduces or eliminates trial and error in selecting a treatment for a 
particular individual cardiovascular patient. 

An additional advantage of the invention derives from the ability to 
eliminate subjects from clinical trials who are predictably non-responsive, or at risk for an 
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adverse response, to a particular treatment regimen. Furthermore, adverse results in an 
early trial can be evaluated to identify polymorphic patterns, so that the adverse results can 
be correlated with a sub-population of the test population permitting exclusion of such 
sub-population from the treatment group. The invention may thus ensure that a beneficial 
drug can be approved for use in the appropriate population, and decrease the number of 
required patients and therefore the duration and cost of clinical trials. It may also lead to 
identification of another subgroup which can be the target for development of another 
therapeutic regimen. 

All of the foregoing applications within the scope of the invention can be 
deemed to be assessments of an individual's cardiovascular status, as the term is broadly 
defined below. 

The foregoing methods of the invention are carried out by comparing a test 
polymorphic pattern established by at least one polymorphic position within a gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, and P-adrenoceptor with a polymorphic pattern of a population of 
individuals exhibiting a predetermined responsivity to the regimen (reference pattern). If 
the test pattern matches the reference pattern, there is a statistically significant probability 
that the individual has the same cardiovascular status as that correlated with the reference 
pattern. 

The polymorphic pattern preferably consists of at least two (and more 
preferably at least three) polymorphic positions, at least one of which is in the gene 
encoding a polypeptide from the group consisting of ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, and a 
second polymorphism in a gene encoding a polypeptide selected from the group consisting 
of ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 
receptor, and P-adrenoceptor. 

Additionally, the invention provides methods for assessing whether a 
particular individual has a genetic predisposition to a cardiovascular pathology. This 
aspect of the invention comprises comparing a test polymorphic pattern established by at 
least one and preferably at least two and most preferably at least three polymorphic 
positions within a gene encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, and P-adrenoceptor in conjunction with one or 
more, and preferably two or more, other polymorphic positions in ACE, ATI , AGT, renin, 
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aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or P- 
adrenoceptor, with a polymorphic pattern of individuals exhibiting a predisposition to a 
cardiovascular syndrome. The conclusion drawn depends on whether the individual's 
polyymorphism pattern matches the reference pattern. 
5 The invention also provides an isolated nucleic acid having a sequence 

corresponding to part or all of the gene encoding ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, the 
nucleic acid comprising a polymorphism in the ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, and P-adrenoceptor gene. In 

10 preferred embodiments, the polymorphism, in combination with one or more other 
polymorphisms in the sequence of the same gene or a gene encoding a protein selected 
from the group consisting of human ACE, ATI, AGT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, and p-adrenoceptor, is predictive of a 
particular type or level of responsivity to a given treatment, or indicates a predisposition to 

15 one or more clinical syndromes associated with cardiovascular disease, or both. 

The isolated polymorphisms according to the invention (which are 
described using the numbering indicated in Table 1 below) include without limitation: 

Nucleic acids encoding renin having one or more polymorphic positions in 
the last exon of the gene (exon 10), which is a cytosine to thymine transition that creates a 

2 0 premature stop codon at position 387. 

Nucleic acids encoding aldosterone synthase promoter at position -344. 
Nucleic acids encoding P-adrenergic receptor- 1 regulator region, at 
positions 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 2757, as numbered in 
GenBank accession number X69168. In preferred embodiments, the bases at specific 

25 positions are 2238 G, 2238 A, 2577 C, 2257 T, 2757 A, and 2757 G. 

Nucleic acids encoding p-adrenergic receptor- 1 coding region, at positions 
231, 758, 1037, 1251, 1403, and 1528, as numbered in GenBank accession number 
J03019. In preferred embodiments, the bases at specific positions are 231 A, 231 G, 1251 
C, 1251 G, 1403 A, 1403 G, 1528 C, and 1528 A. 

30 Nucleic acids encoding P-adrenergic receptor-2 regulatory region, at 

positions 932, 934, 987, 1006, 1 120, 1221, 1541, and 1568, as numbered in GenBank 
accession number M 1 5 1 69, or J02728, or M 1 6 1 06. In preferred embodiments, the bases 
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at specific positions are 934 A, 934 G, 987 C, 987 G, 1006 A, 1006 G, 1 120 C, 1 120 G, 
1221 C, 1221 T, 1541 C, 1541 T, 1568 C, and 1568 T., 

Nucleic acids encoding P-adrenergic receptor-2 coding region, at positions 
839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070, as numbered in GenBank 
accession number Y00106. In preferred embodiments, the bases at specific positions are 
839 A, 839 G, 872 C, 872 G, 1045 A, 1045 G, 1284 C, 1284 T, 1316 A, 1316 C, 1846 C, 
1 846 G, 2032 A, 2032 G, 2068 no insert, 2068 G. 2068 C, 2070 no insert, and 2070 C. 

Nucleic acids encoding endothelin-A receptor at positions 969, 1005, 1 146, 
and 2485, as numbered in GenBank accession number S57498. In preferred embodiments, 
the bases at specific positions are 969 C, 969 T, 1005 A, 1005 G, 1 146 A, 1 146 G, 2485 T, 
and 2485 C. 

Nucleic acids comprising polymorphisms present in other genes, which 
can be used in combination with a polymorphism from a gene encoding a polypeptide 
selected from the group consisting of ACE, ATI, AGT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, and P-adrenoceptor to establish a 
polymorhism pattern, have been disclosed in International Patent Application No. 
PCT/IB98/00475, and include: 

(i) Nucleic acids encoding ACE having one or more polymorphic positions 
at the position in the regulatory region numbered 5106; positions in the coding region 
numbered 375, 582, 731, 1060, 2741,3132, 3387, 3503, and 3906; and position 1451 all 
positions as numbered in GenBank entry X62855. In preferred embodiments, the 
sequences at the polymorphic positions in the ACE regulatory region are one or more of 
5106C and 5106T; and the sequences at the polymorphic positions in the coding region are 
one or more of 375A, 375C, 582C, 582T, 731 A, 73 IG, 1060G, 1060A, 2741G, 2741T, 
3132C, 3132T, 3387T, 3387C, 3503G, 3503C, 3906G, and 3906A. The invention also 
encompasses a nucleic acid encoding a deletion of nucleotides 1451-1783 as numbered in 
GenBank entry X62855. 

(ii) Nucleic acids encoding AGT having one or more polymorphic positions 
at positions in the regulatory region numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 
and 1204; positions in the coding region numbered 273,912,997, 1116, and 1174; and 
position 49 as numbered in GenBank entry M24688. In preferred embodiments, the 
sequences at the polymorphic positions in the AGT regulatory region are one or more of 
395T, 395 A, 412C, 412T, 432G, 432A, 449T, 449C. 692C, 692T, 839G, 839A, 1007G, 
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1007 A, 1072G, 1072 A, 1204C, and 1204A; the sequences at the polymorphic position in 
the coding region are one or more of 273C, 273T. 912C, 912T, 997G, 997C, 1 1 16G, 
1 1 16A, I174C and 1 174A; and the sequence at position 49 in GenBank entry M24688 is 
either A or G. 

(iii) Nucleic acids encoding ATI having one or more polymorphic 
positions at positions in the regulatory region numbered 1427, 1756, 1853, 2046, 2354, 
2355, and 24 1 5; and the position in the coding region numbered 449. In preferred 
embodiments, the sequences at the polymorphic positions in the ATI regulatory region are 
one or more of 1427 A, 1427T, 1756T, 1756A, 1853T, 1853G, 2046T, 2046C, 2354A, 
2354C, 2355G, 2355C, 2415A and 2415G; and the sequences at the polymorphic positions 
in the coding region are one or more of 449G, 449C, 678T, 678C, 1 167 A, 1 167G, 1271 A, 
and 1271C. 

The invention also encompasses libraries of isolated nucleic acid 
sequences, such as arrays on a solid surface, wherein each sequence in the library 
comprises a polymorphic position in the gene encoding ACE, ATI, AGT, renin, 
aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or p- 
adrenoceptor and other polymorphic positions in the other genes, including without 
limitation the polymorphic positions and sequences disclosed herein. Also provided are 
nucleic acid probes that hybridize specifically to the identified polymorphic positions; 
peptides and polypeptides comprising polymorphic positions; and polymorphism-specific 
antibodies, Le., sequence-specific antibodies that bind differentially to polymorphic 
variants of the foregoing genes, that can be used to identify particular polymorphic 
variants. 

In another aspect, the invention provides kits for the determination of 
polymorphic patterns in an individual's genes. The kits comprise a means for detecting 
polymorphic sequences, including without limitation oligonucleotide probes that hybridize 
at or adjacent to the polymorphic positions and polymorphism-specific antibodies. 

In yet another aspect, the invention provides nucleic acid and polypeptide 
targets for use in screening methods to identify candidate cardiovascular drugs. Nucleic 
acid targets may be, e.g., DNA or RNA and are preferably at least about 10, and most 
preferably at least about 1 5, residues in length and comprise one or more polymorphic 
positions in a gene encoding a polypeptide from the group consisting of ACE, ATI , AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, and P- 
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adrenoceptor. Peptide targets are at least about S amino acids in length and may be as 
large or larger than the full-length polypeptides. 



DETAILED DESCRIPTION OF THE INVENTION 

The invention in based, in part, on the discovery that polymorphisms in 
certain genes in the RAAS, endothelin, and p-adrenoceptors pathways define 
polymorphism patterns that correlate with cardiovascular status. Most significantly, by 
comparing a test individual's polymorphism pattern with a reference polymorphism 
pattern, which is a polymorphism pattern from a population of individuals with known 
cardiovascular status, one is able to predict whether the test individual has an increased 
likelihood to share the same cardiovascular status as that correlated with the reference 
polymorphism pattern. In particular, particular patterns correlate with responsiveness to 
ACE inhibitors, non-responsiveness to ACE inhibitors, and predisposition to 
cardiovascular diseases or dysfunctions, including myocardial infarction and stroke. 

The invention provides a powerful predictive tool for clinical testing and 
treatment of cardiovascular disease. For clinical testing, the present invention permits 
smaller, more efficient clinical trials by identifying individuals who are likely to respond 
poorly to a treatment regimen and reducing the amount of uninterpretable data. By 
evaluating a test individual's polymorphism pattern, a physician can prescribe a 
prophylactic or therapeutic regimen customized to that individual's cardiac status. 
Adverse responses to particular therapies can be avoided by excluding those individuals 
whose cardiovascular status puts them at risk for that therapy. Appropriate changes in 
lifestyle, including diet, environmental stress, and exercise levels can be prescribed for 
individuals whose test polymorphic pattern matches a reference pattern that correlates with 
increased predisposition to cardiovascular disease. 

Definitions 

"Cardiovascular status" as used herein refers to the physiological status of 
an individual's cardiovascular system, as reflected in one or more status markers or 
indicators including genotype. Cardiovascular status shall be deemed to include without 
limitation not only the absence or presence of a pathology or disease in one or more 
components of the individual's cardiovascular system and the individual's predisposition 
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to developing such a condition, but also the individual's responsivity, /.e., the ability or 
inability of the individual to respond (positively or negatively) to a particular prophylactic 
or therapeutic regimen or treatment for a cardiovascular condition, such as a drug or a 
class of drugs. A negative response includes one or more adverse reactions and side 
effects. Status markers include without limitation clinical measurements such as, e.g., 
blood pressure, electrocardiographic profile, differentiated blood flow analysis, and the 
presence of increased levels of cellular proteins associated with a cardiovascular event. 
Examples of such proteins, also called diagnostic markers, which are important in cardiac 
events include myosin light chain, myosin heavy chain, myoglobin, troponin 1, troponin T, 
CK-MB, etc. (see U.S. Patents No. 5,604,105 and No. 5,744,358). Status markers 
according to the invention are assessed using conventional methods well known in the art. 
Also included in the evaluation of cardiovascular status are quantitative or qualitative 
changes in status markers with time, such as would be used, e.g., in the determination of 
an individual's response to a particular therapeutic regimen or of a predisposed 
individual's eventual development of a cardiovascular condition. 

Examples of cardiovascular syndromes that are included in the foregoing 
definition of cardiovascular status include diagnosis of, or predisposition to, one or more 
cardiovascular syndromes, such as, e.g., hypertension, acute myocardial infarction, silent 
myocardial infarction, unstable angina, stroke, and atherosclerosis. It will be understood 
that a diagnosis of a cardiovascular syndrome made by a medical practitioner encompasses 
not only clinical measurements but also medical judgment. 

"Responsivity", as used herein, refers to the type and degree of response an 
individual exhibits to a particular therapeutic regimen, /.e., the effect of a treatment on an 
individual. Responsivity breaks down into three major categories: therapeutic effect; no 
effect; and adverse effect. Naturally, there can be differing degrees of a therapeutic effect, 
e.g., between full elimination and partial elimination of symptomology. In addition, 
adverse effects, or side effects, may be observed even though the treatment is beneficial, 
i.e., therapeutically effective. Indeed, the present invention may permit identification of 
individuals with complex responsivity traits or patterns. 

A "predisposition to develop a cardiovascular syndrome" refers to an 
increased likelihood, relative to the general population, to develop a cardiovascular 
syndrome, as defined above. A predisposition does not signify certainty, and development 
of the syndrome may be forestalled or prevented by prophylaxis, e.g., adopting a modified 
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diet, exercise program, or treatment with gene therapy or pharmaceuticals. Naturally, an 
advantage of the present invention is that it permits identification of individuals who are, 
based on their genotype, predisposed to develop a cardiovascular syndrome, and for whom 
prophylactic intervention can be especially important. 

A "polymorphism" as used herein denotes a variation in the nucleotide 
sequence of a gene in an individual. Genes that have different nucleotide sequences as a 
result of a polymorphism are "alleles". A "polymorphic position" is a predetermined 
nucleotide position within the sequence. In some cases, genetic polymorphisms are 
reflected by an amino acid sequence variation, and thus a polymorphic position can result 
in location of a polymorphism in the amino acid sequence at a predetermined position in 
the sequence of a polypeptide. An individual "homozygous" for a particular 
polymorphism is one in which both copies of the gene contain the same sequence at the 
polymorphic position. An individual "heterozygous" for a particular polymorphism is one 
in which the two copies of the gene contain different sequences at the polymorphic 
position. 

A "polymorphism pattern" as used herein denotes a set of one or more 
polymorphisms, including without limitation single nucleotide polymorphisms, which 
may be contained in the sequence of a single gene or a plurality of genes. In the simplest 
case, a polymorphism pattern can consist of a single nucleotide polymorphism in only one 
position of one of two alleles of an individual . However, one has to look at both copies of 
a gene. A polymorphism pattern that is appropriate for assessing a particular aspect of 
cardiovascular status (e.g., predisposition to hypertension) need not contain the same 
number (nor identity, of course) of polymorphisms as a polymorphism pattern that would 
be appropriate for assessing another aspect of cardiovascular status (e.g., responsivity to 
ACE inhibitors for control of hypertension). A "test polymorphism pattern" as used herein 
is a polymorphism pattern determined for a human subject of undefined cardiovascular 
status. A "reference polymorphism pattern" as used herein is determined from a 
statistically significant correlation of patterns in a population of individuals with pre- 
determined cardiovascular status. 

"Nucleic acid" or "polynucleotide" as used herein refers to purine- and 
pyrimidine-containing polymers of any length, either polyribonucleotides or 
polydeoxyribonucleotides or mixed polyribo-polydeoxyribo nucleotides. Nucleic acids 
include without limitation single- and double-stranded molecules, i.e., DNA-DNA, DNA- 
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RNA and RNA-RNA hybrids, as well as "protein nucleic acids" (PNA) formed by 
conjugating bases to an amino acid backbone. This also includes nucleic acids containing 
modified bases and non-naturally occurring phosphoester analog bonds, such as 
phosphorothioates and thioesters. The term nucleic acid molecule, and in particular DNA 
or RNA molecule, refers only to the primary and secondary structure of the molecule, and 
does not limit it to any particular tertiary fornis. Thus, this term includes double-stranded 
DNA found, inter alia, in linear or circular DNA molecules (e.g., restriction fragments), 
plasmids, and chromosomes. In discussing the structure of particular double-stranded 
DNA molecules, sequences may be described herein according to the normal convention 
of giving only the sequence in the 5' to 3* direction along the nontranscribed strand of 
DNA {i.e., the strand having a sequence homologous to the mRNA). A "recombinant 
DNA molecule" is a DNA molecule that has undergone a molecular biological 
manipulation. 

As used herein, the term "oligonucleotide" refers to a nucleic acid, 
generally of at least 10, preferably at least 15, and more preferably at least 20 nucleotides, 
that is hybridizable to a genomic DNA molecule, a cDNA molecule, or an mRNA 
molecule encoding a gene, cDNA, mRNA, or other nucleic acid of interest. 
Oligonucleotides can be labeled, e.g., with "P-nucleotides or nucleotides to which a label, 
such as biotin, has been covalently conjugated. In one embodiment, a labeled 
oligonucleotide can be used as a probe to detect the presence of a nucleic acid. In another 
embodiment, oligonucleotides (one or both of which may be labeled) can be used as PGR 
primers, either for cloning full length or a fragment of a gene of interest, or to detect the 
presence of nucleic acids encoding the gene of interest. In a further embodiment, an 
oligonucleotide of the invention can form a triple helix with a double stranded sequence of 
interest in a DNA molecule. In still another embodiment, a library of oligonucleotides 
arranged on a solid support, such as a silicon wafer or chip, can be used to detect various 
polymorphisms of interest. Generally, oligonucleotides are prepared synthetically, 
preferably on a nucleic acid synthesizer. Accordingly, oligonucleotides can be prepared 
with non-naturally occurring phosphoester analog bonds, such as thioester bonds. 

An "isolated" nucleic acid or polypeptide as used herein refers to a nucleic 
acid or polypeptide that is removed from its original environment (for example, its natural 
environment if it is naturally occurring). An isolated nucleic acid or polypeptide contains 
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less than about 50%, preferably less than about 75%, and most preferably less than about 
90%, of the cellular components with which it was originally associated, 

A nucleic acid or polypeptide sequence, that is "derived from" a designated 
sequence refers to a sequence that corresponds to a region of the designated sequence. For 
nucleic acid sequences, this encompasses sequences that are identical to or complementary 
to the sequence. 

A "probe" refers to a nucleic acid or oligonucleotide that forms a hybrid 
structure with a sequence in a target nucleic acid due to complementarity of at least one 
sequence in the probe with a sequence in the target nucleic acid. Generally, a probe is 
labeled so it can be detected after hybridization. 

A nucleic acid molecule is "hybridizable" to another nucleic acid molecule, 
such as a cDNA, genomic DNA, or RNA, when a single stranded form of the nucleic acid 
molecule can anneal to the other nucleic acid molecule under the appropriate conditions of 
temperature and solution ionic strength {see Sambrook et aL, 1989, Molecular Cloning: A 
Laboratory Manual^ Second Edition, Cold Spring Harbor Laboratory Press: Cold Spring 
Harbor, New York). The conditions of temperature and ionic strength determine the 
"stringency" of the hybridization. For preliminary screening for homologous nucleic 
acids, low stringency hybridization conditions, corresponding to a of 55°C, can be 
used, e.g., 5x SSC, 0. 1% SDS, 0.25% milk, and no formamide; or 30% formamide, 5x 
SSC, 0.5% SDS). Moderate stringency hybridization conditions correspond to a higher 
T^, e.g., 40% formamide, with 5x or 6x SCC. High stringency hybridization conditions 
correspond to the highest T^, e.g., 50% formamide, 5x or 6x SCC. Hybridization requires 
that the two nucleic acids contain complementary sequences, although depending on the 
stringency of the hybridization, mismatches between bases are possible. The appropriate 
stringency for hybridizing nucleic acids depends on the length of the nucleic acids and the 
degree of complementation, variables well known in the art. The greater the degree of 
similarity or homology between two nucleotide sequences, the greater the value of T^ for 
hybrids of nucleic acids having those sequences. The relative stability (corresponding to 
higher TJ of nucleic acid hybridizations decreases in the following order: RNA:RNA, 
DNAiRNA, DNAiDNA. For hybrids of greater than 100 nucleotides in length, equations 
for calculating T^, have been derived (see Sambrook et ai, supra, 9.50-9.51). For 
hybridization with shorter nucleic acids, /.e., oligonucleotides, the position of mismatches 
becomes more important, and the length of the oligonucleotide determines its specificity 
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{see Sambrook et ai, supra, 1 1 .7-1 1.8). A minimum length for a hybridizable nucleic acid 
is at least about 10 nucleotides; preferably at least about 15 nucleotides; and more 
preferably the length is at least about 20 nucleotides. . 

In a specific embodiment, the term "standard hybridization conditions" 
5 refers to a T„, of 55°C, and utilizes conditions as set forth above. In a preferred 
embodiment, the is 60''C; in a more preferred embodiment, the is 65°C. In a 
specific embodiment, "high stringency" refers to hybridization and/or washing conditions 
at 68 °C in 0.2XSSC, at 42 °C in 50% formamide, 4XSSC, or under conditions that afford 
levels of hybridization equivalent to those observed under either of these two conditions. 

10 A "gene" for a particular protein as used herein refers to a contiguous 

nucleic acid sequence corresponding to a sequence present in a genome which comprises 
(i) a "coding (or transcribed) region," which comprises exons (i.e., sequences encoding a 
polypeptide sequence, or "protein-coding" or "transcribed sequences"), introns, sequences 
at the junction between exons and introns, and 5' and 3* untranslated regions (uTRs); and 

15 (ii) regulatory sequences, which flank the coding region at both 5* and 3* termini. For 
example, the "ACE gene" as used herein encompasses the regulatory and coding regions 
of the human gene encoding angiotensin converting enzyme. Similarly, the "AGT gene" 
encompasses regulatory and coding regions of the human gene encoding angiotensinogen 
and the "ATI gene" encompasses regulatory and coding regions of the human gene 

2 0 encoding type I angiotensin II receptor. Typically, regulatory sequences according to the 
invention are located 5' (i.e., upstream) of the coding region segment. The reference 
sequences, obtained from GenBank, which were used in practicing the present invention 
are shown in Table 1. 
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Table 1. GenBank Accession Numbers 





Abbreviation 


Compared master 
sequence 


Numbering according to 
sequence entry in GenBank 


SEQ 

ID 

NO. 


5 


ACT Regulatory 
Regions 


XI 5323 


XI 5323 


122 




AGT Coding 
Region 


M24686 (exon 2) 
M24687 (exon 3) 
M24688 (exon 4) 
iVI24689 (exon 5) 

X62855 (intron 16) 


Protein-coding sequences from 
exon 2-5 were spliced together 
as described in the GenBank 
entries. 

Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon. 

X62855 


123 
124 
125 
126 

1 T7 




ACE Regulatory 
Region 


X94359 


X94359 




10 


ACE Coding 
Region 


J04144 


J04144 

Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon. 


129 




ATI Regulatory 
Region 


U07144 


U07144 


130 


15 


ATI Coding 
Region 


S80239 (exon 3) 
S77410(exon 5) 


The protein-coding sequence of 
S80239 was spliced to position 
288 of entry S77410. 
Nucleotide 1 is assigned to the 
first nucleotide of the initiator 
methionine codon in entry 
S80239. 


131 
132 




Renin 




M10030, X34914, X01391. 
X01734 


133 




Aldosterone 
synthase 




D13752 


134 




AT2 




U 10273 


135 


20 


P-adrenoceptorl 
Regulatory Region 
(BIP) 




X69168/g28421 


136 


25 


P-adrenoceptor1 
Coding Region 
(BIR) 




J03019/gl7899 


137 
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P-adrenoceptor2 
Regulatory Region 
(B2P) 




M1S169 

J02728 
M16106 


138 


P-adrenoceptor2 
Coding Region 
(B2R) 




Y00106/g29370 


139 


EndothelinA 
(ET^) Receptor 
Coding 




S57498 


140 



The present inventors have surprisingly and unexpectedly discovered the 
existence of genetic polymorphisms within the human gene encoding ACE, ATI, AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or P- 
adrenoceptor which, singly or in combination, can be used to assess cardiovascular status, 
depending on which component of cardiovascular status is under evaluation. In 
accordance with the invention, the polymorphic pattern of the gene, alone or in 
combination with other genes encoding renin, ACE, AGT, ATI, AT2, aldosterone 
synthase, endothelin receptor, and P-adrenergic receptors 1 and 2 in an individual can 
predict the responsivity of the individual to particular therapeutic interventions and serve 
as an indicator of predisposition to various forms of cardiovascular disease. The invention 
provides methods for assessing cardiovascular status by detecting polymorphic patterns in 
an individual. The present invention also provides isolated nucleic acids derived from the 
gene which comprise these polymorphisms, including probes which hybridize specifically 
to polymorphic positions and primers that amplify the region of the gene in which the 
polymorphism is located; isolated polypeptides and peptides comprising polymorphic 
residues; and antibodies which specifically recognize ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or p-adrenoceptor 
polypeptides containing one or more polymorphic amino acids. 

Methods for Assessing Cardiovascular Status 

The present invention provides diagnostic methods for assessing 
cardiovascular status in a human individual. The methods are carried out by comparing a 
polymorphic position or pattern ("test polymorphic pattern") within the individual's gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or p-adrenoceptor with the polymorphic patterns of humans 
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exhibiting a predetermined cardiovascular status ("reference polymorphic pattern"). If the 
cardiovascular status is the prediction of responsivity to a therapy, a single polymorphic 
position can provide a pattern for comparison. However, it is preferable to use more than 
one polymorphic position for the pattern to improve the accuracy of the prediction. If the 
cardiovascular status is predisposition to a cardiovascular syndrome, at least two, and 
preferably at least three, polymorphic positions are used to make the pattern. In addition, 
other polymorphisms in genes encoding angiotensin converting enzyme (ACE), 
angiotensinogen (ACT), type 1 angiotensin II receptor (ATI), type 2 angiotensin II 
receptor, renin, aldosterone synthase, endothelin, receptor or P-adrenergic receptors 1 and 
2 can be used to establish a polymorphic pattern for the individual. 

For any meaningful prediction, the polymorphic pattern of the individual is 
identical to the polymorphic pattern of individuals who exhibit particular status markers, 
cardiovascular syndromes, and/or particular patterns of response to therapeutic 
interventions. 

In one embodiment, the method involves comparing an individual's test 
polymorphic pattern with reference polymorphic patterns of individuals who have been 
shown to respond positively or negatively to a particular therapeutic regimen. Therapeutic 
regimen as used herein refers to treatments aimed at the elimination or amelioration of 
symptoms and events associated cardiovascular disease. Such treatments include without 
limitation one or more of alteration in diet, lifestyle, and exercise regimen; invasive and 
noninvasive surgical techniques such as atherectomy, angioplasty, and coronary bypass 
surgery; and pharmaceutical interventions, such as administration of ACE inhibitors, 
angiotensin II receptor antagonists, diuretics, alpha-adrenoreceptor antagonists, cardiac 
glycosides, phosphodiesterase inhibitors, beta-adrenoreceptor antagonists, calcium channel 
blockers, HMG-CoA reductase inhibitors, imidazoline receptor blockers, endothelin 
receptor blockers, and organic nitrites. Interventions with pharmaceutical agents not yet 
known whose activity correlates with particular polymorphic patterns associated with 
cardiovascular disease are also encompassed. The present inventors have discovered that 
particular polymorphic patterns correlate with an individual's responsivity to ACE 
inhibitors (see, e.g.. Example 3 below). It is contemplated, for example, that patients who 
are candidates for a particular therapeutic regimen will be screened for polymorphic 
patterns that correlate with responsivity to that particular regimen. 
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In another embodiment, the method involves comparing an individual's 
polymorphic pattern with polymorphic patterns of individuals who exhibit or have 
exhibited one or more markers of cardiovascular disease, such as, e.g., high blood 
pressure, abnormal electrocardiographic profile, myocardial infarction, unstable angina, 
stroke, or atherosclerosis (see, e.g., Example 2 below) and drawing analogous conclusions 
as to the individual's responsivity to therapy, predisposition to developing a syndrome, 
etc., as detailed above. 

Identification of Polymorphic Patterns 
In practicing the methods of the invention, an individual's polymorphic 
pattern can be established e.g. by obtaining DNA from the individual and determining the 
sequence at a predetermined polymorphic position or positions in a gene, or more than one 
gene. 

The DNA may be obtained from any cell source. Non-limiting examples of 
cell sources available in clinical practice include without limitation blood cells, buccal 
cells, cervicovaginal cells, epithelial cells from urine, fetal cells, or any cells present in 
tissue obtained by biopsy. Cells may also be obtained from body fluids, including without 
limitation blood, saliva, sweat, urine, cerebrospinal fluid, feces, and tissue exudates at the 
site of infection or inflammation. DNA is extracted from the cell source or body fluid 
using any of the numerous methods that are standard in the art. It will be understood that 
the particular method used to extract DNA will depend on the nature of the source. 

Determination of the sequence of the extracted DNA at polymorphic 
positions is achieved by any means known in the art, including but not limited to direct 
sequencing, hybridization with allele-specific oligonucleotides, allele-specific PGR, 
ligase-PCR, HOT cleavage, denaturing gradient gel electrophoresis (DGGE), and single- 
stranded conformational polymorphism (SSCP). Direct sequencing may be accomplished 
by any method, including without limitation chemical sequencing, using the Maxam- 
Gilbert method; by enzymatic sequencing, using the Sanger method; mass spectrometry 
sequencing; and sequencing using a chip-based technology. See, e.g.. Little et al. Genet. 
Anal., 1996, 6:151. Preferably, DNA from a subject is first subjected to amplification by 
polymerase chain reaction (PGR) using specific amplification primers. 

In an alternate embodiment, biopsy tissue is obtained from a subject. 
Antibodies that are capable of distinguishing between different polymorphic forms of 
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ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 
receptor, or P-adrenoceptor are then applied to samples of the tissue to determine the 
presence or absence of a polymorphic form specified by the antibody. The antibodies may 
be polyclonal or monoclonal, preferably monoclonal. Measurement of specific antibody 
binding to cells may be accomplished by any known method, e.g., quantitative flow 
cytometry, or enzyme-linked or fluorescence-linked immunoassay. The presence or 
absence of a particular polymorphism or polymorphic pattern, and its allelic distribution 
(i.e., homozygosity vs. heterozygosity) is determined by comparing the values obtained 
from a patient with norms established from populations of patients having known 
polymorphic patterns. 

In another alternate embodiment, RNA is isolated from biopsy tissue using 
standard methods well known to those of ordinary skill in the art such as guanidium 
thiocyanate-phenol-chloroform extraction (Chomocyznski et aL, Anal. Biochem., 1987, 
162:156). The isolated RNA is then subjected to coupled reverse transcription and 
amplification by polymerase chain reaction (RT-PCR), using specific oligonucleotide 
primers that are specific for a selected polymorphism. Conditions for primer annealing are 
chosen to ensure specific reverse transcription and amplification; thus, the appearance of 
an amplification product is diagnostic of the presence of a particular polymorphism. In 
another embodiment, RNA is reverse-transcribed and amplified, after which the amplified 
sequences are identified by, e.g., direct sequencing. In still another embodiment, cDNA 
obtained from the RNA can be cloned and sequenced to identify a polymorphism. 

Establishing Reference Polymorphism Patterns 
In practicing the present invention, the distribution of polymorphic patterns 
in a large number of individuals exhibiting particular cardiovascular status is determined 
by any of the methods described above, and compared with the distribution of 
polymorphic patterns in patients that have been matched for age, ethnic origin, and/or any 
other statistically or medically relevant parameters, who exhibit quantitatively or 
qualitatively different cardiovascular status. Correlations are achieved using any method 
known in the art, including nominal logistic regression or standard least squares regression 
analysis. In this manner, it is possible to establish statistically significant correlations 
between particular polymorphic patterns and particular cardiovascular statuses. It is 
further possible to establish statistically significant correlations between particular 
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polymorphic patterns and changes in cardiovascular status such as, would result, e.g., from 
particular treatment regimens. Thus, it is possible to correlate polymorphic patterns with 
responsivity to particular treatments. 

A statistically significant correlation preferably has a "p" value of less than 
or equal to 0.05. Any standard statistical method can be used to calculate these values, 
such as the normal Student's T Test, or Fischer's Exact Test. 

The identity and number of polymorphisms to be included in a reference 
pattern depends not only on the prevalence of a polymorphism and its predictive value for 
the particular use, but also on the value of the use and its requirement for accuracy of 
prediction. The greater the predictive value of a polymorphism, the lower the need for 
inclusion of more than one polymorphism in the reference pattern. However, if a 
polymorphism is very rare, then its absence from an individual's pattern might provide no 
indication as to whether the individual has a particular status. Under these circumstances, 
it might be advisable to select instead two or more polymorphisms which are more 
prevalent. Even if none of them has a high predictive value on its own, the presence of 
both (or all three) of them might be sufficiently predictive for the particular purpose. 

If for example the use for a reference pattern is prediction of response to a 
drug, and among the afflicted population only a 30% response to the drug is observed, the 
reference pattern need only permit selection of a population that improves the response 
rate by 10% to provide a significant improvement in the state of the art. On the other 
hand, if the use for the reference pattern is selection of subjects for a particular clinical 
study, the pattern should be as selective as possible and should therefore include a 
plurality of polymorphisms that together provide a high predictive accuracy for the 
intended response. 

In establishing reference polymorphism patterns, it is desirable to use a 
defined population. For example, tissue libraries collected and maintained by state or 
national departments of health can provide a valuable resource, since genotypes 
determined from these samples can be matched with medical history, and particularly 
cardiovascular status, of the individual. Such tissue libraries are found, for example, in 
Sweden, Iceland, Norway, and Finland. As can be readily understood by one of ordinary 
skill in the art, specific polymorphisms may be associated with a closely linked 
population. However, other polymorphisms in the same gene may correlate with 
cardiovascular status of other genetically related populations. Thus, in addition to the 
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specific polymorphisms provided in the instant application, the invention identifies genes 
in which any polymorphisms can be used to establish reference and test polymorphism 
patterns for evaluating cardiovascular status of individuals in the population. 

In a specific embodiment, DNA samples can be obtained form a well 
defined population, such as 277 Caucasian males bom in Uppsala, Sweden between 1920 
and 1924. In a specific embodiment, such individuals are selected for the test population 
based on their medical history, /.e, they were either (i) healthy, with no signs of 
cardiovascular disease (100); or (ii) had suffered one of acute myocardial infarction (68), 
silent myocardial infarction (34), stroke (18), stroke and acute myocardial infarction (19), 
or high blood pressure at age 50 (39), DNA samples are obtained from each individual. 

In a specific embodiment, DNA sequence analysis can be carried out by: 
(i) amplifying short fragments of each of the genes using polymerase chain reaction (PGR) 
and (ii) sequencing the amplified fragments. The sequences obtained from each individual 
can then be compared with the first known sequences, e.g., as set forth in Table 1, to 
identify polymorphic positions. 

Comparing Test Patterns to Reference Patterns 
As noted above, the test pattern from an individual can be compared to a 
reference pattern established for a predetermined cardiovascular status. Identity between 
the test pattern and the reference pattern means that the tested individual has a probability 
of having the same cardiovascular status as that represented by the reference pattern. As 
discussed above, this probability depends on the prevalence of the polymorphism and the 
statistical significance of its correlation with a cardiovascular status. 

Polymorphic Positions 

Polymorphic positions in the genes encoding ACE, ATI, AGT, renin, 
aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or P- 
adrenoceptor which are encompassed by the invention can be identified by determining the 
DNA sequence of all or part of the gene in a multiplicity of individuals in a population. 
DNA sequence determination may be achieved using any conventional method, including, 
e.g., chemical or enzymatic sequencing. 
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The polymorphic positions of genes for use in the invention include without 
limitation those listed below, whose numbering corresponds to the GenBank sequences 
listed in Table 1. 

(i) ACE: positions in the regulatory region (designated ACR) 
5 numbered 5 106, 5349, and 5496; positions in the coding region (designated ACE) 

numbered 375, 582, 731, 1060, 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; and 
position 145 1 as numbered in GenBank entry X62855. 

(ii) AGT: positions in the regulatory region (designated AGR) 
numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 1218; positions in the 

10 coding region (designated AGT) numbered 273, 620, 803, 912, 997, 1116, and 1 174; and 
position 49 as numbered in GenBank entry M24688. 

(iii) ATI : positions in the regulatory region (designated ATR) numbered 
1427, 1756, 1853, 2046, 2354, 2355, and 2415; and positions in the coding region 
(designated ATI) numbered 449, 678, 1 167, and 1271. 

15 (iv) Renin: A mutant renin gene in familial elevation of prorenin, a 

point mutation in the last exon of the gene (exon 10), has been identified (Villard et ai, J. 
Biol. Chem., 1994, 269:30307-12). A cytosine to thymine transition creates a premature 
stop codon at position 387 resulting in a truncated form of renin with 20 amino acids 
deleted from the carboxyl terminus. 

2 0 (v) Aldosterone synthase: A position in the promoter region of 

aldosterone synthase, position -344 (with the initiation codon starting at 1) has been 
reported by Cambien et al at the International Meeting on Hypertension held in 
Amsterdam in June 1998. 

P-adrenergic receptor- 1, positions in the regulatory region (designated BPl) 

25 numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 2757; and positions in the 
coding region (designated BRl) numbered 231, 758, 1037, 1251, 1403, and 1528. 

P-adrenergic receptor-2, positions in the regulatory region (designated B2P) 
numbered 932, 934, 987, 1006, 1120, 1221, 1541, and 1568; and positions in the coding 
region (designated B2R) numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, 

30 and 2070. 

Endothelin receptor type A coding region (designated ET^) numbered 969, 
1005, 1146, and 2485. 
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In preferred embodiments, the base at each of the above polymorphic 
positions is one of: 

(i) ACE Regulatory Region: 5106C, 5106T, 5349A. 5349T, 5496T, 

and 5496C; 

5 (ii) ACE Coding Region: 375A, 375C, 582C, 582T, 731 A, 73 IG, 

1060G, 1060A, 1215C, 1215T, 2193G, 2193A, 2328A, 2328G, 2741G, 2741T, 3132C, 
3132T, 3387T. 3387C, 3503G, 3503C, 3906G, and 3906A; and a deletion of nucleotides 
1451-1783 as numbered in GenBank entry X62855; 

(iii) AGT Regulatory Region: 395T, 395A, 412C, 412T, 432G, 432A, 
10 449T, 449C, 692C, 692T, 839G, 839A. 1007G, 1007A, 1072G. 1072A, 1204C, 1204A, 

1218A, 1218G; 

(iv) AGT Coding Region: 273C, 273T, 620C, 620T, 803T, 803C, 9 1 2C, 
91 2T, 997G, 997C, 1 116G, 1 116A, 1 174C, and 1 174A; and A or G at position 49 in 
GenBank entry M24688; 

15 (v) ATI Regulatory Region: 1427A, 1427T, 1756T, 1756A, 1 853T, 

1853G, 2046T, 2046C, 2354A, 2354C, 2355G, 2355C, 2415A and 2415G; and • 

(vi) ATI Coding Region: 449G, 449C, 678T, 678C, 1 167A, 1 167G, 
1271A, and 1271C. 

(vii) P-adrenergic receptor-l regulatory region: 2238 G, 2238 A, 2577 C, 
20 2257 T, 2757 A, and 2757 G. 

(viii) P-adrenergic receptor-l coding region: 23 1 A, 231 G, 758 C, 758 T, 
1251 C, 1251 G, 1403 A, 1403 G, 1528 C, and 1528 A. 

(ix) p-adrenergic receptor-2 regulatory region: 934 A, 934 G, 987 C, 
987 G, 1006 A, 1006 G, 1120 C, 1120G, 1221 C, 1221 T, 1541 C, 1541 T, 1568 C, and 

25 1568 T. 

(x) P-adrenergic receptor-2 coding region: 839 A, 839 G, 872 C, 872 G, 
1045A, 1045 G, 1284 C, 1284 T, 1316 A, 1316 C, 1846 C, 1846 G. 2032 A, 2032 G.2068 
no insert, 2068 G, 2068 C, 2070 no insert, and 2070 C. 

(xi) Endothelin receptor type A: 969 C, 969 T, 1005 A, 1005 G, 1 146 
30 A, 1 146 G, 2485 T, and 2485 C. 

An individual may be homozygous or heterozygous for a particular 
polymorphic position (see, e.g.. Table 6 below). 
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Non-limiting examples of polymorphic patterns comprising one or more 
polymorphism in ACE, AGT, and/or ATI genes according to the invention include the 
following, which were correlated with an increased incidence of clinical signs of 
cardiovascular disease: 

5 ACR 5349 A/T, AGR 1218 A; ACR 5496 C, AGR 1204 A/C; ACR 5496 

C/T, AGR 1218 A, AGT 620 C/T; ACE 2193 A, AGR 1204 C, ACE 2328 G; ACE 2193 
A, AGR 1204 A/C; ACE 3387 T, AGR 1218 A; ACE 3387 T, AGT 620 C/T; AGR 1204 
A/C, ATI 678 C/T; AGR 1204 A/C, ATI 1271 A/C; ACE 1215 C, AGR 1204 A/C; AGR 
1204 A/C, ATI 1 167 A, ACE 3906 AJG; AGR 1204 A, AGT 620 C, ATI 1271 A, ATI 

10 1 167 A, AGR 395 A/T; AGR 1204 A/C, AGT 620 C/T, ATI 1271 A/C, ATI 1167 A, 

AGR 395 T; AGR 1204 A/C, AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A/G, AGR 395 T; 
AGR 1204 A, ATI 678 C, ATI 1 167 A, AGR 395 A/T; AGR 1204 A/C, ATI 678 C/T, 
ATI 1 167 A, AGR 395 T; AGT 620 C/T, ATI 1271 A/C, ATI 1 167 A, AGR 395 T; AGT 
620 C/T. ATI 1271 A/C, ATI 1167 A/G, AGR 395 T; AGT 620 C, ATI 1271 A, ATI 

15 1167 A, AGR 395 A/T; AGT 620 C, ATI 678 A, ATI 1 167 A, AGR 395 A/T; AGT 620 
C/T, ATI 678 C/T; ATI 1 167 A, AGR 395 T; ACE 2193 A. AGR 1218 A, AGT 803 A; 
ACE 2193 A, AGT 620 C/T; ACE 2328 G, AGT 620 C/T; ACE 3387 T, AGR 1204 A/C; 
ACE 2193 A, ACE 2328 G, AGR 1204 C; ACE 2193 A/G, AGR 1072 G/G. ATI 1 167 
A/A. Additional polymorphism patterns are shown in the Tables in Examples 4 and 5, 

20 below. 

Polvmorpltism Patterns Correlated With ACE Inhibitor Responsiveness 
The following table lists a set of polymorphism patterns that have been 
found to correlate with responsiveness to ACE inhibitor treatment: 
25 Response to ACE-Inhibitor Treatment 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2193: 


A/G 


AGR: 1072: 


G/G 


ATI: 1167: 


A/A 


ACE:2193: 


A/G 


AGR: 1072: 


G/G 


ACE: 1060: 


G/G 


ATR:2354: 


A/A 


ATl:678: 


C/T 


ATI: 1167: 


A/A 


ACR:5496: 


C/T 


AGR: 1204: 


A/A 


AGR:839: 


G/G 


ACR:5496: 


C/T 


AGR: 1204: 


A/A 


AGT:620: 


C/C 


ACR:5496: 


C/T 


ACE:1060: 


G/G 


AGR:449: 


C/T 
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Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACR:5496: 


C/T 


ACE:1215: 


C/T 


AGR:1204: 


AJA 


ACR:5496: 


C/T 


ACE:3906: 


A/G ■ 


AGR:1218: 


A/G 


ACR:5496: 


cn 


AGR:449: 


C/T 


AGR:839: 


G/G 


ACR:5496: 


C/T 


AGR:449: 


C/T 


AT1:1271: 


AJA 


ACR:5496: 


C/T 


AGR:449: 


C/T 


AGR:1072: 


G/G 


ACR:5496: 


C/T 


AGR:1072: 


G/G 


AGR:1204: 


AJA 


ACE: 1060: 


G/G 


AGR:449: 


C/T 


AGT:620: 


C/C 


ACE: 1060: 


G/G 


AGR:1007: 


G/G 


AT1:678: 


C/T 


ACE: 121 5: 


C/T 


ACE:3906: 


A/G 


AGR:1218: 


A/G 


AGR:449: 


C/T 


AGR:1204: 


AJA 


AGT:620: 


C/C 


AGR:449: 


C/T 


AGR:1204: 


A/A 


AGT:1116: 


G/G 


AGR:449: 


C/T 


AGR:839: 


G/G 


AGT:620: 


C/C 


AGR:449: 


C/T 


AGT:620: 


C/C 


ACT: 1116: 


G/G 


AGR:449: 


C/T 


AGT:1116: 


G/G 


ATI: 1271: 


AJA 


AGR:449: 


C/T 


AGR:1072: 


G/G 


AGT:620: 


C/C 


AGR:1007: 


G/C 


AGR:1072: 


G/G 


AT1:678: 


C/T 


AGR:1072: 


G/G 


AGR:1204: 


AJA 


AGT:1116: 


G/G 


ACR:5496: 


cn 


AGR:1204: 


AJA 


ATI: 1167: 


AJA 


AGR:1007: 


G/G 


AGR:1204: 


AJA 


AGT:1 lie- 


G/G 



Polymorphism Patterns of Correlated With ACE Inhibitor Non-responsiveness 
The following table lists a set of polymorphism patterns that have been 
found to correlate with non-responsiveness to ACE inhibitor treatment: 
Non-Response to ACE-Inhibitor Treatment 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE: 1060: 


G/G 


AGR:1204: 


A/C 


AGT;620: 


C/C 


ACE:1215: 


C/T 


ACE:2193: 


A/G 


AGR:1204 


A/C 


ACE.2193: 


A/G 


ATR:2046: 


C/T 


AT1:678: 


C/C 
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Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2193: 


A/G 


AT1:678: 


C/C 


AT1:1271: 


A/C 



Polymorphism Patterns Correlated With Predisposition to MI 
The following table lists a set of polymorphism patterns that have been 
found to correlate with predisposition to myocardial infarction: 

Predisposition to MI 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE;2193: 


AJA 


AGR:1204: 


A/C 






AGR:449: 


T/T 


AGR:1204: 


A/C 


AT1:1271: 


A/C 


ACE:2193: 


A/G 


AGR:620: 


C/T 


AGR:1116 


G/G 


ACE:2193: 


A/G 


AGR:449: 


T/T 


AT1:1271: 


A/C 


ACR:5349: 


A/A . 


AGR:449: 


T/T 


ATI: 1271: 


A/C 


ACE:2193: 


A/A 


AGR:620: 


C/T 


ATI: 1271: 


A/A 



Polymorphism Patterns Correlated With Predisposition to Stroke 
The following table lists a set of polymorphism patterns that have been 
found to correlate with predisposition to stroke: 

Predisposition to Stroke 



Position 1 


Genotype 1 


Position 2 


Genotype 2 


Position 3 


Genotype 3 


ACE:2I93: 


A/A 


AGR:395: 


A/T 






AGR:1007: 


A/G 


AGR:1072: 


G/G 


ATI: 1167: 


A/A 


AGR:395: 


A/T 


AGR:1072: 


G/G 


AGR:1218: 


A/G 



Polymorphism Patterns in ^Adrenergic Receptor Genes 
The following table lists a set of polymorphism patterns that have been 
found in P-adrenergic receptor genes: 

Positions carrying genetic variation in the Beta adrenergic receptors 1 and 
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BIP 


BIR 


B2R 


B2P 


2238 C/A 


231 A/G 


839 A/G 


932 


2440 


758 CAT 
1037 


872 C/G 


934 A/G 


2493 


1251 C/G 


1045 A/G 


987 C/G 
1006 A/G 


2502 


1403 A/G 


1284 C/T 


11 20 C/G 


2577 C/T 


1528 C/A 


1316A/C 


1221 C/T 


2585 




1846 C/G 


1541 C/T 


2693 




1891 


1568 C/T 


2724 




2032 A/G 




2757 A/G 




2068 no insert, G/G, 
C/G 

2070 no insert /C 





BIP: Beta adrenergic receptor 1, regulatory promoter region. 
BIR: Beta adrenergic receptor 1, coding region. 
B2P: Beta adrenergic receptor 2, regulatory promoter region. 
15 B2R: Beta adrenergic receptor 2, coding region. 

A number of different polymorphisms have been identified in the type 2 P- 
adrenoceptor. All of these differed from the wild type sequence by a single base change. 
Four of the polymorphisms alter the amino acid sequence of the receptor protein (Hall, 
Thorax, 1996, 51:351-353). The amino acid sequence modifications are described in 
2 0 greater detail below: 

Argl6^Gly: The Glyl6 variant undergoes an enhanced agonist-promoted 
down regulation as compared to wild type but the coupling to adenylyl cyclase and agonist 
binding are maintained (Liggett, Am. J. Respir. Crit, Care Med., 1997, 156:S 156-S162). 

Gln27--Glu: The Glu27 variant displays very little agonist-promoted 
25 downregulalion and the coupling to adenylyl cyclase and agonist binding are maintained 

Val34-Met: Met34 is very rare. No altering of receptor function has been 

found (id.). 
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Thrl64-Ile: Uncommon (about 5%). The Ilel64 variant shows depressed 
coupling to adenylyl cyclase and decreased affinities for agonists with hydroxy! groups on 
their P-carbons, such as epinephrine, norepinephrine, and isoproterenol compared to wild 

type (id,). 

The polymorphism at nucleic acid 523 (CGG-- AGG) might be linked with 
one of the other functional polymorphisms (id.). 

There are no differences in frequency of these polymorphisms between the 
normal group and those with asthma but they have been correlated to differences in 
response to treatment with agonists in asthma, e.g., the Glyl6 variant undergoes an 
enhanced agonist-promoted downregulation compared to wild type (id,). 



Polymorphism Patterns in Endothelin Receptor Type A Gene 
The following table lists a set of polymorphism patterns that have been 

found in the coding region of the endothelin receptor type A gene: 



Position 969 


Position lOOS 


Position 1146 


Position 2485 


C/C 


A/A 


A/A 


T/T 


C/T 


A/G 


A/G 


T/C 


T/T 


G/G 







Isolated Polymorphic Nucleic Acids. Vectors. Probes & Primers and Arrays 
Vectors for Expression of Polymorphic Variants 
The present invention provides isolated nucleic acids comprising the 
polymorphic positions described above for the human genes encoding ACE, ATI, AGT, 
renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, and P- 
adrenoceptor; vectors comprising the nucleic acids; and transformed host cells comprising 
the vectors. The invention also provides probes which are useful for detecting these 
polymorphisms. 

The nucleic acids encoding a gene comprising a polymorphism that is 
useful for determining cardiovascular status of an individual is particularly valuable for 
screening, whether by direct screening of the nucleic acid with the polymorphism, or by 
screening the polypeptide expressed by that nucleic acid. 
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In practicing the present invention, many conventional techniques in 
molecular biology, microbiology, and recombinant DNA, are used. Such techniques are 
well known and are explained fully in, for example, Sambrook et al, Molecular Cloning: 
A Laboratory Manual, Second Edition. 1989 (Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, New York); DNA Cloning: A Practical Approach, Volumes I and II, 1985 
(D.N. Glover ed.); Oligonucleotide Synthesis, 1984, (M.L. Gait ed.); Nucleic Acid 
Hybridization, 1985, (Hames and Higgins); Ausubel et al, Current Protocols in 
Molecular Biology, 1997 (John Wiley and Sons); and Methods in Enzymolog)> Vol. 154 
and Vol. 155 (Wu and Grossman, and Wu, eds., respectively). 

Insertion of nucleic acids (typically DNAs) comprising the sequences of the 
present invention into a vector is easily accomplished when the termini of both the DNAs 
and the vector comprise compatible restriction sites. If this cannot be done, it may be 
necessary to modify the termini of the DNAs and/or vector by digesting back single- 
stranded DNA overhangs generated by restriction endonuclease cleavage to produce blunt 
ends, or to achieve the same result by filling in the single-stranded termini with an 
appropriate DNA polymerase. 

Alternatively, any site desired may be produced, e.g., by ligating nucleotide 
sequences (linkers) onto the termini. Such linkers may comprise specific oligonucleotide 
sequences that define desired restriction sites. Restriction sites can also be generated by 
the use of the polymerase chain reaction (PGR). See, e.g., Saiki et aL, Science, 1988, 
239:48. The cleaved vector and the DNA fragments may also be modified if required by 
homopolymeric tailing. 

The nucleic acids may be isolated directly from cells or may be chemically 
synthesized using known methods. Alternatively, the polymerase chain reaction (PGR) 
method can be used to produce the nucleic acids of the invention, using either chemically 
synthesized strands or genomic material as templates. Primers used for PGR can be 
synthesized using the sequence information provided herein and can further be designed to 
introduce appropriate new restriction sites, if desirable, to facilitate incorporation into a 
given vector for recombinant expression. 

The nucleic acids of the present invention may be flanked by native gene 
sequences, or may be associated with heterologous sequences, including promoters, 
enhancers, response elements, signal sequences, polyadenylation sequences, introns, 5'- 
and 3'- noncoding regions, and the like. 
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The invention also provides nucleic acid vectors comprising the disclosed 
genes or derivatives or fragments thereof. A large number of vectors, including plasmid 
and fungal vectors, have been described for replication and/or expression in a variety of 
eukaryotic and prokaryotic hosts, and may be used for gene therapy as well as for simple 
cloning or protein expression. Non-limiting examples of suitable vectors include without 
limitation pUC plasmids, pET plasmids (Novagen, Inc., Madison, WI), or pRSET or pREP 
(Invitrogen, San Diego, CA), and many appropriate host cells, using methods disclosed or 
cited herein or otherwise known to those skilled in the relevant art. The particular choice 
of vector/host is not critical to the practice of the invention. 

Suitable host cells may be transformed/transfected/infected as appropriate 
by any suitable method including electroporation, CaClj mediated DNA uptake, calcium 
phosphate precipitation, fungal or viral infection, lipofection, microinjection, 
microprojectile, or other established methods. Appropriate host cells included bacteria, 
archebacteria, fungi, especially yeast, and plant and animal cells, especially mammalian 
cells. A large number of transcription initiation and termination regulatory regions have 
been isolated and shown to be effective in the transcription and translation of heterologous 
proteins in the various hosts. Examples of these regions, methods of isolation, manner of 
manipulation, etc, are known in the art. Under appropriate expression conditions, host 
cells can be used as a source of recombinantly produced ACE-, AGT-, or ATI -derived 
peptides and polypeptides. 

Nucleic acids encoding ACE-, AGT-, or ATI -derived gene sequences may 
also be introduced into cells by recombination events. For example, such a sequence can 
be introduced into a cell and thereby effect homologous recombination at the site of an 
endogenous gene or a sequence with substantial identity to the gene. Other 
recombination-based methods such as nonhomologous recombinations or deletion of 
endogenous genes by homologous recombination may also be used. 

Olieonucleotides 

The nucleic acids of the present invention find use as probes for the 
detection of genetic polymorphisms, as primers for the expression of polymorphisms, or in 
molecular library arrays for high throughput screening. 

Probes in accordance with the present invention comprise without 
limitation isolated nucleic acids of about 10 - 100 bp, preferably 15-75 bp and most 
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preferably 17-25 bp in length, which hybridize at high stringency to one or more of the 
gene-derived polymorphic sequences disclosed herein or to a sequence immediately 
adjacent to a polymorphic position. Furthermore, in some embodiments a full-length gene 
sequence may be used as a probe. In one series of embodiments, the probes span the 
polymorphic positions in the genes disclosed above. In another series of embodiments, the 
probes correspond to sequences immediately adjacent to the polymorphic positions. 

The oligonucleotide nucleic acids may also be modified by many means 
known in the art. Non-limiting examples of such modifications include methylation, 
"caps", substitution of one or more of the naturally occurring nucleotides with an analog, 
intemucleotide modifications such as, for example, those with uncharged linkages {e.g., 
methyl phosphonates, phosphotriesters, phosphoroamidates, carbamates, etc.) and with 
charged linkages {e.g., phosphorothioates, phosphorodithioates, etc.). Nucleic acids may 
contain one or more additional covalently linked moieties, such as, for example, proteins 
{e.g., nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.)y intercalators (e.g., 
acridine, psoralen, etc.), chelators {e.g., metals, radioactive metals, iron, oxidative metals, 
etc.), and alkylators. PNAs are also included. The nucleic acid may be derivatized by 
formation of a methyl or ethyl phosphotriester or an alkyl phosphoramidate linkage. 
Furthermore, the nucleic acid sequences of the present invention may also be modified 
with a label capable of providing a detectable signal, either directly or indirectly. 
Exemplary labels include radioisotopes, fluorescent molecules, biotin, and the like. 

PCR amplification of gene segments that contain a polymorphism provides 
a powerful tool for detecting the polymorphism. The oligonucleotides of the invention can 
also be used as PCR primers to amplify gene segments containing a polymorphism of 
interest. The amplified segment can be evaluated for the presence or absence of a 
polymorphism by restriction endonuclease activity, SSCP, or by direct sequencing. In 
another embodiment, the primer is specific for a polymorphic sequence on the gene. If the 
polymorphism is present, the primer can hybridize and DNA will be produced by PCR. 
However, if the polymorphism is absent, the primer will not hybridize, and no DNA will 
be produced. Thus, PCR can be used to directly evaluate whether a polymorphism is 
present or absent. 

Molecular library arrays of oligonucleotides (including oligonucleotides 
with modifications as described above) are another powerful tool for rapidly assessing 
whether one or more polymorphisms are present in a ACE, ATI, ACT, renin, aldosterone 
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synthase, type-2 angiotensin II receptor, endothelin receptor^ or ^adrenoceptor gene, 
preferably in combination with other genes. Molecular library arrays are disclosed in US 
Patents No. 5,677,195, No. 5,599,695, No. 5,545,531, and No, 5,510,270. 



Polymorphic Polypeptides and Polymorphism-Specific Antibodies 
The present invention encompasses isolated peptides and polypeptides 
encoded by all or a portion of a gene encoding a polypeptide selected from the group 
consisting of ACE, ATI, ACT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, and p-adrenoceptor, comprising polymorphic positions disclosed 
above. In one preferred embodiment, the peptides and polypeptides are useful screening 
targets to identify cardiovascular drugs. In another preferred embodiment, the peptides 
and polypeptides are capable of eliciting antibodies in a suitable host animal that react 
specifically with a polypeptide comprising the polymorphic position and distinguish it 
from other polypeptides having a different amino acid sequence at that position. 

Polypeptides according to the invention are preferably at least five or more 
residues in length, preferably at least fifteen residues. Methods for obtaining these 
polypeptides are described below. Many conventional techniques in protein biochemistry 
and immunology are used. Such techniques are well known and are explained in 
Immunochemical Methods in Cell and Molecular Biology^ 1987 (Mayer and Waler, eds; 
Academic Press, London); Scopes, Protein Purification: Principles and Practice^ Second 
Edition 1987 (Springer- Verlag, N.Y.) and Handbook of Experimental Immunology y 
Volumes I-IV 1986 (Weir and Blackwell eds.). 

Nucleic acids comprising protein-coding sequences can be used to direct 
the recombinant expression of ACE, ATI, ACT, renin, aldosterone synthase, type-2 
angiotensin II receptor, endothelin receptor, or P-adrenoceptor-derived polypeptides in 
intact cells or in cell-free translation systems. The known genetic code, tailored if desired 
for more efficient expression in a given host organism, can be used to synthesize 
oligonucleotides encoding the desired amino acid sequences. The polypeptides may be 
isolated from human cells, or from heterologous organisms or cells (including, but not 
limited to, bacteria, fungi, insect, plant, and mammalian cells) into which an appropriate 
protein-coding sequence has been introduced and expressed. Furthermore, the 
polypeptides may be part of recombinant fusion proteins. 
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Peptides and polypeptides may be chemically synthesized by commercially 
available automated procedures, including, without limitation, exclusive solid phase 
synthesis, partial solid phase methods, fragment condensation or classical solution 
synthesis. The polypeptides are preferably prepared by solid phase peptide synthesis as 
described by Merrifield, J. Am. Chem. Soc, 1963, 85:2149. 

Methods for polypeptide purification are well-known in the art, including, 
without limitation, preparative disc-gel electrophoresis, isoelectric focusing, HPLC, 
reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, and 
countercurrent distribution. For some purposes, it is preferable to produce the polypeptide 
in a recombinant system in which the protein contains an additional sequence tag that 
facilitates purification, such as, but not limited to, a polyhistidine sequence. The 
polypeptide can then be purified from a crude lysate of the host cell by chromatography on 
an appropriate solid-phase matrix. Alternatively, antibodies produced against ACE, ATI, 
ACT, renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin receptor, or 
P-adrenoceptor or against peptides derived therefrom, can be used as purification reagents. 
Other purification methods are possible. 

The present invention also encompasses derivatives and homologues of the 
polypeptides. For some purposes, nucleic acid sequences encoding the peptides may be 
altered by substitutions, additions, or deletions that provide for functionally equivalent 
molecules, i.e., function-conservative variants. For example, one or more amino acid 
residues within the sequence can be substituted by another amino acid of similar 
properties, such as, for example, positively charged amino acids (arginine, lysine, and 
histidine); negatively charged amino acids (aspartate and glutamate); polar neutral amino 
acids; and non-polar amino acids. 

The isolated polypeptides may be modified by, for example, 
phosphorylation, sulfation, acylation, or other protein modifications. They may also be 
modified with a label capable of providing a detectable signal, either directly or indirectly, 
including, but not limited to, radioisotopes and fluorescent compounds. 

The present invention also encompasses antibodies that specifically 
recognize the polymorphic positions of the invention and distinguish a peptide or 
polypeptide containing a particular polymorphism from one that contains a different 
sequence at that position. Such polymorphic position-specific antibodies according to the 
present invention include polyclonal and monoclonal antibodies. The antibodies may be 
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elicited in an animal host by immunization with ACE, ATI, AGT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or P-adrenoceptor-derived 
immunogenic components or may be formed by in vitro immunization of immune cells. 
The immunogenic components used to elicit the antibodies may be isolated from human 
cells or produced in recombinant systems. The antibodies may also be produced in 
recombinant systems programmed with appropriate antibody-encoding DNA. 
Alternatively, the antibodies may be constructed by biochemical reconstitution of purified 
heavy and light chains. The antibodies include hybrid antibodies (/.e, containing two sets 
of heavy chain/light chain combinations, each of which recognizes a different antigen), 
chimeric antibodies (/,e., in which either the heavy chains, light chains, or both, are fusion 
proteins), and univalent antibodies (/.e,, comprised of a heavy chain/light chain complex 
bound to the constant region of a second heavy chain). Also included are Fab fragments, 
including Fab' and F(ab)2 fragments of antibodies. Methods for the production of all of the 
above types of antibodies and derivatives are well-known in the art and are discussed in 
more detail below. For example, techniques for producing and processing polyclonal 
antisera are disclosed in Mayer and Walker, Immunochemical Methods in Cell and 
Molecular Biology, 1987 (Academic Press, London). The general methodology for 
making monoclonal antibodies by hybridomas is well known. Immortal antibody- 
producing cell lines can be created by cell fusion, and also by other techniques such as 
direct transformation of B lymphocytes with oncogenic DNA, or transfection with 
Epstein-Barr virus. See, e.g., Schreier a/., Hybridoma Techniques^ 1980; U.S. Patent 
Nos. 4,341,761; 4,399,121; 4,427,783; 4,444,887; 4,466,917; 4,472,500; 4,491.632; and 
4,493,890. Panels of monoclonal antibodies produced against ACE, AGT, or ATI -derived 
epitopes can be screened for various properties; i.e. for isotype, epitope affinity, etc. 

The antibodies of this invention can be purified by standard methods, 
including but not limited to preparative disc-gel electrophoresis, isoelectric focusing, 
HPLC, reversed-phase HPLC, gel filtration, ion exchange and partition chromatography, 
and countercurrent distribution. Purification methods for antibodies are disclosed, e.g., in 
The Art of Antibody Purification, 1989 (Amicon Division, W.R. Grace & Co.) General 
protein purification methods are described in Protein Purification: Principles and 
Practice, R.K. Scopes, Ed., 1987 (Springer- Veriag, New York, NY). 

Methods for determining the immunogenic capability of the disclosed 
sequences and the characteristics of the resulting sequence-specific antibodies and immune 
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cells are well-known in the art. For example, antibodies elicited in response to a peptide 
comprising a particular polymorphic sequence can be tested for their ability to specifically 
recognize that polymorphic sequence, Le., to bind differentially to a peptide or polypeptide 
comprising the polymorphic sequence and thus distinguish it from a similar peptide or 
polypeptide containing a different sequence at the same position. 

Diagnostic Methods and Kits 
The present invention provides kits for the determination of the sequence at 
a polymorphic position or positions within the ACE, ATI, ACT, renin, aldosterone 
synthase, type-2 angiotensin II receptor, endothelin receptor, or ^-adrenoceptor gene in 
an individual, in combination with determination of the sequence at polymorphism 
positions of other genes. The kits comprise a means for determining the sequence at the 
polymorphic positions, and may optionally include data for analysis of polymorphic 
patterns. The means for sequence determination may comprise suitable nucleic acid-based 
and immunological reagents (see below). Preferably, the kits also comprise suitable 
buffers, control reagents where appropriate, and directions for determining the sequence at 
a polymorphic position. The kits may also comprise data for correlation of particular 
polymorphic patterns with desirable treatment regimens or other indicators. 

Nucleic Acid-'Based Diagnostic Methods and Kits 
The invention provides nucleic acid-based methods for detecting 
polymorphic patterns in a biological sample. The sequence at particular polymorphic 
positions in the genes is determined using any suitable means known in the art, including 
without limitation hybridization with polymorphism-specific probes and direct 
sequencing. 

The present invention also provides kits suitable for nucleic acid-based 
diagnostic applications. In one embodiment, diagnostic kits include the following 
components: 

(i) Probe DNA: The probe DNA may be pre-labelled; alternatively, 
the probe DNA may be unlabelled and the ingredients for labelling may be included in the 
kit in separate containers; and 
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(ii) Hybridization reagents: The kit may also contain other suitably 
packaged reagents and materials needed for the particular hybridization protocol, including 
solid-phase matrices, if applicable, and standards. 

In another embodiment, diagnostic kits include: 

(i) Sequence determination primers: Sequencing primers may be 
pre-labelled or may contain an affinity purification or attachment moiety; and 

(ii) Sequence determination reagents: The kit may also contain 
other suitably packaged reagents and materials needed for the particular sequencing 
protocol. In one preferred embodiment, the kit comprises a panel of sequencing primers, 
whose sequences correspond to sequences adjacent to the polymorphic positions. 

Antibody-Based Diagnostic Methods and Kits 
The invention also provides antibody-based methods for detecting 
polymorphic patterns in a biological sample. The methods comprise the steps of: (i) 
contacting a sample with one or more antibody preparations, wherein each of the antibody 
preparations is specific for a particular polymorphic form of the gene under conditions in 
which a stable antigen-antibody complex can form between the antibody and antigenic 
components in the sample; and (ii) detecting any antigen-antibody complex formed in step 
(i) using any suitable means known in the art, wherein the detection of a complex indicates 
the presence of the particular polymorphic form in the sample. 

Typically, immunoassays use either a labelled antibody or a labelled 
antigenic component (e.g., that competes with the antigen in the sample for binding to the 
antibody). Suitable labels include without limitation enzyme-based, fluorescent, 
chemiluminescent, radioactive, or dye molecules. Assays that amplify the signals from the 
probe are also known, such as, for example, those that utilize biotin and avidin, and 
enzyme-labelled immunoassays, such as ELISA assays. 

The present invention also provides kits suitable for antibody-based 
diagnostic applications. Diagnostic kits typically include one or more of the following 
components: 

(i) Polymorphism-specific antibodies: The antibodies may be pre-labelled; 
alternatively, the antibody may be unlabelled and the ingredients for labelling may be 
included in the kit in separate containers, or a secondary, labelled antibody is provided; 
and 
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(ii) Reaction components: The kit may also contain other suitably 
packaged reagents and materials needed for the particular immunoassay protocol, 
including solid-phase matrices, if applicable, and standards. 

The kits referred to above may include instructions for conducting the test. 
Furthermore, in preferred embodiments, the diagnostic kits are adaptable to high- 
throughput and/or automated operation. 



Drug Targets and Screening Methods 
According to the present invention, nucleotide sequences derived from the 
gene encoding a polymorphic form of ACE, ATI, AGT, renin, aldosterone synthase, type- 
2 angiotensin II receptor, endothelin receptor, or p-adrenoceptor, and peptide sequences 
derived from that polymorphic for, are useful targets to identify cardiovascular drugs, /.e., 
compounds that are effective in treating one or more clinical symptoms of cardiovascular 
disease. Drug targets include without limitation (i) isolated nucleic acids derived from the 
gene encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II 
receptor, endothelin receptor, or P-adrenoceptor and (ii) isolated peptides and polypeptides 
derived from ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or P-adrenoceptor polypeptides, each of which comprises one or more 
polymorphic positions. 

In vitro screening methods 
In one series of embodiments, an isolated nucleic acid comprising one or 
more polymorphic positions is tested in vitro for its ability to bind test compounds in a 
sequence-specific manner. The methods comprise: 

(i) providing a first nucleic acid containing a particular sequence at a 
polymorphic position and a second nucleic acid whose sequence is identical to that of the 
first nucleic acid except for a different sequence at the same polymorphic position; 

(ii) contacting the nucleic acids with a multiplicity of test compounds under 
conditions appropriate for binding; and 

(iii) identifying those compounds that bind selectively to either the first or 
second nucleic acid sequence. 

Selective binding as used herein refers to any measurable difference in any 
parameter of binding, such as, e.g., binding affinity, binding capacity, etc. 
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In another series of embodiments, an isolated peptide or polypeptide 
comprising one or more polymorphic positions is tested in vitro for its ability to bind test 
compounds in a sequence-specific manner. The screening methods involve: 

(i) providing a first peptide or polypeptide containing a particular sequence 
at a polymorphic position and a second peptide or polypeptide whose sequence is identical 
to the first peptide or polypeptide except for a different sequence at the same polymorphic 
position; 

(ii) contacting the polypeptides with a multiplicity of test compounds under 
conditions appropriate for binding; and 

(iii) identifying those compounds that bind selectively to one of the nucleic 
acid sequences. 

In preferred embodiments, high-throughput screening protocols are used to 
survey a large number of test compounds for their ability to bind the genes or peptides 
disclosed above in a sequence-specific manner. 

Test compounds are screened from large libraries of synthetic or natural 
compounds. Numerous means are currently used for random and directed synthesis of 
saccharide, peptide, and nucleic acid based compounds. Synthetic compound libraries are 
commercially available from Maybridge Chemical Co. (Trevillet, Cornwall, UK), 
Comgenex (Princeton, NJ), Brandon Associates (Merrimack, NH), and Microsource (New 
Milford, CT). A rare chemical library is available from Aldrich (Milwaukee, WI). 
Alternatively, libraries of natural compounds in the form of bacterial, fungal, plant and 
animal extracts are available from e.g. Pan Laboratories (Bothell, WA) or MycoSearch 
(NC), or are readily producible. Additionally, natural and synthetically produced libraries 
and compounds are readily modified through conventional chemical, physical, and 
biochemical means. 

In vivo screening methods 
Intact cells or whole animals expressing polymorphic variants of a gene 
encoding ACE, ATI, AGT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor, or P-adrenoceptor can be used in screening methods to identify 
candidate cardiovascular drugs. 
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In one series of embodiments, a permanent cell line is established from an 
individual exhibiting a particular polymorphic pattern. Alternatively, cells (including 
without limitation mammalian, insect, yeast, or bacterial cells) are programmed to express 
a gene comprising one or more polymorphic sequences by introduction of appropriate 
5 DNA. Identification of candidate compounds can be achieved using any suitable assay, 
including without limitation (i) assays that measure selective binding of test compounds to 
particular polymorphic variants of the gene; (ii) assays that measure the ability of a test 
compound to modify {i.e., inhibit or enhance) a measurable activity or function of the 
gene; and (iii) assays that measure the ability of a compound to modify (i.e., inhibit or 
10 enhance) the transcriptional activity of sequences derived from the promoter (/.e., 
regulatory) regions the gene. 

In another series of embodiments, transgenic animals are created in which 

(i) a human ACE, ATI, ACT, renin, aldosterone synthase, type-2 angiotensin II receptor, 
endothelin receptor^ or ^-adrenoceptor gene having different sequences at particular 

15 polymorphic positions are stably inserted into the genome of the transgenic animal; and/or 

(ii) the endogenous genes are inactivated and replaced with human genes having different 
sequences at particular polymorphic positions. See, e.g., Coffman, Semin. Nephrol., 1997, 
17:404; Esther et ai. Lab. Invest., 1996, 74:953; Murakami et al. Blood Press. SuppL, 
1996, 2:36. Such animals can be treated with candidate compounds and monitored for one 

20 or more clinical markers of cardiovascular status. 

Furthermore, populations that are not amenable to an established treatment 
for a cardiovascular disease or disorder can be selected for testing of alternative 
treatments. Moreover, treatments that are not as effective in the general population, but 
that are highly effective in the selected population, may be identified that otherwise would 
25 be overlooked. This is an especially powerful advantage of the present invention, since it 
eliminates some of the randomness associated with clinical trials. 

The following are intended as non-limiting examples of the invention. 

Example 1: Methods for Identification of Polymorphic Positions in Human Genes 
30 Encoding ACE, ACT, and ATI 

The following studies were performed to identify polymorphic residues 
within the genes encoding human ACE, ACT, and ATI. 
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DNA samples were obtained from 277 individuals. The individuals were 
Caucasian males bom in Uppsala, Sweden between 1920 and 1924. Individuals were 
selected for the test population based on their medical history, i.e., they were either (i) 
healthy, with no signs of cardiovascular disease (100); or (ii) had suffered one of acute 
myocardial infarction (68), silent myocardial infarction (34), stroke (18), stroke and acute 
myocardial infarction (19), or high blood pressure at age 50 (39). DNA samples were 
obtained from each individual. 

DNA sequence analysis was carried out by: (i) amplifying short fragments 
of each of the ACE, ACT, and ATI genes using polymerase chain reaction (PCR) and (ii) 
sequencing the amplified fragments. The sequences obtained from each individual were 
then compared with known ACE, ACT, and ATI genomic sequences (see Table 1). 

(i) Amplification: PCR reactions employed the primers shown in Table 2 

below. 
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Sequence 


5 ' -TGCGTGCTTCAGAAGTCC-3 ' 


5 '-CCAGGG AGGTGAAGAAATC-3 ' 


5 '-AGCCAGGCAGTAATGACCT-3 ' 


5'-GCCCACTGTTCCCTTATG-3' 


5 '-TGCCCTGACTGACAGAGC-3' 


5'-GCCCTGGTGTGCCTGT-3' 


5 '-TGCCTGGATATGTGTTGC-S ' 


5'-TGCCTGGATATGTGTTGC-3' 


5 •-GCCCTCGCCTCTCACT-3 ' 


5 '-TCCCCTCTCCCTGTACCT-3 ' 


5'-GTGCTGGGGTAGGGTAGA-3' 


5 '-TCCCCCTG ACCTGGCT-3 ' 


5 '-GGGGCACCGTG ATGTT-3 ' 


5'-GGGGCACCGTGATGTT-3' 


5'-GCCAGAGCCTTTGGTTT-3' 


5'-TGGAAGAGCCGACTTACAG-3' 


5'-TCCCAGAGGCAAAGAGG-3' 


5'-GTTTCTACTGCGGCTTCAT-3' 


5'-GTTTCTACTGCGGCTTCAT-3' 
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Sequence 


5 '-TCCTGGAAGAGGGAGTTTC-3 ' 


5 '-GCAGG ATG AGAGCAACAAC-3 ' 


5'-CTGGAGACCACTCCCATCCTTTCT-3' 


5'-GATGTGGCCATCACATTCGTCAGAT-3' 


5'-CTTCCGTGGGACTCATGT-3' 


5--TGCACCGTGAGGCTCTA-3' 


5'-GCCCAATAGGAGGAAGCA-3' 


5 '-CCCACCCCATCTCC AAGAA-3 ' 
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5 '-TCCCTGATGGGCTGCTCTC-3 ' 


5'-CAAGGCCCTCAACCAACTC-3' 


5'-TTCCCACAAAAGCTCCAGTG-3' 


5 '-GGCTCAA AATGGC AAGTGTT-3 ' 


5'-GGGCCATGTCCTTCTGACTC-3' 
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5 '-GGCCATGTTGAGCTACTTCAA-3 ' 


5'-CCTCCAGCCTTGGGTCTTAA-3' 


5 '-TTCCC ATCCC AGTCTCTGGT-3 ' 
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5'-TCCAGCCGCTCCCCATC-3' 
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Where indicated, the primers were modified in one of the following ways: 
(i) a biotin molecule was conjugated to the 5' terminus of the indicated sequence (B); (ii) a 
sequence of nucleotides derived from M13, 5'-CAGGAAACAGCTATGACT-3' (SEQ ID 
NO: 120), was added at the 5' terminus of the indicated sequence (MT); or (iii) the 
sequence 5'-AGTCACGACGTTGTAAAACGACGGCCAGT-3^ (SEQ ID NO: 121) was 
added at the 5' terminus of the indicated sequence (T = Tail). Nucleotides were numbered 
according to the GenBank sequences listed in Table 1 where indicated. When the 
sequences involved were not publicly available, the numbering was as in the following 
examples: The designation "i-4: 1-200" indicates that the primer sequence is located 
within the sequence extending 200 bp upstream of, and including, the nucleotide 
immediately upstream of the first coding nucleotide of exon 4. Similarly, the designation 
"i+4: 1-200" indicates that the primer sequence is located within the sequence extending 
from the nucleotide that is located immediately downstream of the last coding nucleotide 
of exon 4 downstream for 200 bp. In each case, the specific location of the primer 
sequence is indicated in Table 2 in the column marked "Nucleotides". 

The reaction components used for PGR are described in Table 3, and the 
reaction conditions for PGR are described Table 4, below. 
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All of the PCR products (except fragments ACEDI, ATI-spec. 1 and ATI- 
spec. 2) were subjected to solid phase sequencing according to the protocol commercially 
available from Pharmacia Biotech. The sequencing reactions are performed with a 
sequencing primer having a complementary sequence to the "Tail" sequence previously 
described in Table 2. The nucleotide sequence of the sequencing primer was 5'- 
CGACGTTGTAAAACGACGGCCAGT-3' (SEQ ID NO: 122), and the primer was 
fluorescently labeled with a Cy-5-molecule on the 5 '-nucleotide. The positions carrying a 
genetic variation were identified by determination of the nucleotide sequence by the use of 
the ALFexpress"^"^ system commercially available from Pharmacia Biotech. 

The detection of the fragment ACEDI was performed by analyzing the sizes 
of the amplified fragments by gel eleclrophoresis, where the presence of a shorter PCR 
product (192 base pairs) indicated the D-allele and a longer PCR product (479 base pairs) 
indicated the I-allele. The presence of both bands indicated a heterozygote for the two 
alleles. The detection of the allele-specific reaction of position ATl-1271 was performed 
by separately running two parallel PCR reactions on the same sample and comparing the 
sizes of the amplified fragments. A PCR product of 501 base pairs should always be 
present as a control in both parallel runs, whereas the presence of a PCR product of 378 
base pairs in the reaction designated ATI -spec. 1 indicated the presence of an A in this 
position. The presence of a PCR product of 378 base pairs in the reaction designated ATI - 
spec. 2 indicated a C in this position. If the shorter PCR product was present in both 
reactions, the individual is a heterozygote for A and C. 

Results 

The analysis described above resulted in the identification of polymorphic 
positions within the regulatory and coding/intron segments of the human genes encoding 
ACE, AGT, and ATI . The polymorphic positions, the variant nucleotides found at each of 
the positions, and the PCR fragment in which the polymorphism was identified are shown 
in Table 6 below. Also shown are the frequencies of each genotype in a population of 90 
individuals, expressed as the percent of the study population having that genotype. 
Polymorphisms that resulted in alternate amino acids in ACE, AGT, or ATI are also 
indicated. As used herein below, the designations "AGR" , "ACR", and "ATR" refer to 
the regulatory regions of the human AGT, ACE, and ATI genes, respectively; and the 
designations "AGT", "ACE", and "ATI", refer to the coding regions of the AGT, ACE. 
and ATI genes. 
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A subset of these polymorphic positions were further analyzed in an 
additional 187 individuals. Table 7 shows the polymorphic positions, the sequence at 
these positions, and the genotype frequencies for each position in a population of 277 as 
described in Example 1 above. 



Table 7 - Polymorphic Positions and Frequencies 
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Example 2: Correlation of Polymorphism Patterns with Cardiovascular Disease 

The polymorphic positions identified as in Example 1 were correlated with 
the following markers of cardiovascular status present in the study population: healthy 
(100 individuals evaluated); myocardial infarction (MI) (120 individuals); stroke (37 
individuals); and high blood pressure (BP) (39 individuals). Polymorphic patterns, /.e., 
combinations of sequences at particular polymorphic positions, that show a statistically 
significant correlation with one or more of these markers are shown below. 
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13.5 


7.7 




Sumni 


ary of the three previous polvmorphic patterns: 






Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


3 


14 


6 


6 


27 




% within group 


3 


11.7 


16.2 


15.4 




Aul c 


520 C, ATI 678 A, ATI 1 167 A, AGR 395 A/T 






Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


1 


2 


2 


1 


5 




% within group 


1 


1.7 


5.4 


2.6 




AGT( 


520 C/T, ATI 678 C/T; AT 1 1 1 67 A, AGR 395 1 








Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


3 


15 


4 


4 


24 




% within group 


3 


12.5 


10.8 


10.3 




Sumrr 


mry of the two previous polymorphic patterns: 




Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


4 


17 


6 


5 


29 




% within group 


4 


14.2 


16.2 


12.9 




ACE 2193 A. AGR 1218 A. AGT 803 A 






Healthy 


MI 


Stroke 


High BP 


Total (n) 




# of events 


2 


5 


1 


3 


11 




% within group 


2 


4.2 


2.7 


7.7 
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ACE 2193 A. AGT 620 C/T 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


I 


11 


3 


2 


16 


% within group 


1 


9.2 


.8.1 


5.1 




2328 G, AGT 620 C 


rr 




Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


1 


11 


3 


2 


16 


% within group 


1 


9.2 


8.1 . 


5.1 





ACE 3387 T. AGR 1 204 A/C 





Healthy 


MI 


Stroke 


High BP 


Total (n) 


# of events 


0 


10 


3 


3 


15 


% within group 


0 


8.3 


8.1 


7.7 





Example 3: Correlation Between a Specific Polymorphism Pattern 
and Treatment Response 

The following study was undertaken to define polymorphic patterns in the 
human ACE, AGT, and/or ATI genes that predict the efficacy of treatments for 
cardiovascular disease. 

Two groups of hypertensive patients were studied, 41 in the first group and 
20 in the second group. The groups were analyzed independently and in combination. 

The patients in this population were each treated with one of the following 
five ACE inhibitors: Captopril, Trandolapril, Lisinopril, Fosinopril, or Enalapril. The 
effect of the drugs on mean arterial blood pressure was quantified. Mean arterial blood 
pressure was defined as 2/3 of the diastolic blood pressure + 1/3 of systolic blood pressure. 
The individuals were also categorized as "high responders," i.e., those exhibiting a 
decrease of more than 1 6 mm Hg during treatment with an ACE inhibitor drug, and "low 
responders," Le,, those not exhibiting a decrease of more than 16 mm Hg. 

One particular polymorphic pattern, ACE 2193 A/G, AGR 1072 G/G, ATI 
1 167 A/A, which was present in 51% of the first study population, discriminated between 
high responders and low responders. In the second group of 20 patients, the pattern was 
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less prevalent (25%), but the correlation with lowered blood pressure remained statistically 
significant. Individuals having this polymorphic pattern (designated "1" below) 
experienced a larger decrease in blood pressure than those lacking this polymorphic 
pattern (designated "0" below). 



Table 9 - Response to Treatment Correlates with Polymorphic Pattern 



Polymorphic Pattern 


Observations 


Mean (mm Hg) 
Change in B.P. 


S.D. 


0 


36 


-11,4 


8.6 


1 


25 


-18.1 


9.7 



Furthermore, the distribution of high responders and low responders (as 
defined above) was as follows: 

Table 10 - Responder Status Correlates with Polymorphic Pattern 



Polymorphic Pattern 


Low responder % 


High responder % 


0 


80.1 


19.4 1 


1 


24.0 


76.0 1 



Taken together, the results fi-om the two groups indicate that the presence 
of this polymorphic pattern correlates with an incremental decrease of 6.4-7.3 mm Hg 
relative to individuals not having this polymorphic pattern. 

The prevalence of this polymorphic pattern was 41% in this hypertensive 
population. This suggests that testing for this polymorphic pattern in hypertensive 
patients, followed by prescribing ACE inhibitors only to those patients having this 
polymorphic pattern, could increase the response rate from 43% (in a hypertensive 
population in general) to 76% in hypertensive population selected according to the 
methods of the invention. 

If even one polymorphism was absent from the pattern, the high resolution 
of the variable and therefore the predictive value would be lost. 
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Example 4: Correlation Between a Specific Polymorphism Pattern 
and Treatment Response or Predisposition to a 
Cardiovascular Syndrome 

In accordance with the methods disclosed in the above examples, additional 
5 correlations of polymorphic patterns with either responsiveness to ACE inhibitors, non- 

responsiveness to ACE inhibitors, predisposition to myocardial infarction, or 
predisposition to stroke were derived. These correlations are presented in the following 
table: 
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Prediction: 


Non-response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Non-response to ACE-inhibitor treatment 


Response to ACE-inhibitor treatment 


Predisposition to MI 


Predisposition to Ml 


Predisposition to MI 


Predisposition to MI 


Predisposition to MI 


Predisposition to Ml 


Predisposition to stroke | 


Genotype3 


C/C 


C/C 


G/G 


C/C 


G/G 


A/A 




C/T 


G/G 


A/A 


A/C 


G/G 




A/C 


G/G 


A/C 


A/C 


A/A 




Position 3 


ATI:678: 


AGT:620: 


VO 

H 

a 
< 


AGT:620: 


AGT:in6: 


AT1:1271: 


AGT:620: 


AT1:678: 


AGT:1116: 


ATI:! 167: 


ATl:127l: 


AGT:1116: 




AT1:1271: 


AGR:1116: 


AT1:1271: 


AT1:1271: 


AT1:1271: 




Genotype2 


5 


A/A 


A/A 


G/G 


C/C 


G/G 


G/G 


G/G 


A/A 


A/A 


C/C 


A/A 


Aye 


A/C 


C/T 


T/T 


T/T 


C/T 




Position 2 


ATR:2046: 


AGR:1204: 


AGR:1204: 


AGR:839: 


AGT:620: 


AGT:lll6: 


AGR:1072: 


AGR:1072: 


AGR:1204: 


AGR:1204: 


AT 1:678: 


AGR:1204: 


AGR:1204: 


AGR:1204: 


AGR:620: 


ANP:449: 


AGR:449: 


AGR:620: 


AGR:395: 


Genotypel 


A/G 


B 




C/T 


C/T 


C/T 


C/T 


C/T 


G/C 


G/G 


C/T 


A/G 


G/G 


A/A 


T/T 


A/G 


/VG 


A/A 


A/A 


A/A 


Position 1 


ACE:2193: 


O 
< 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:449: 


AGR:1007: 


AGR:1072: 


ACR:5496: 


ACE:2193: 


AGR:I007: 


ACE:2193: 


AGR:449: 


ACE:2193: 


ACE:2193: 


ACR:5349: 


ACE:2193: 


ACE:2193: 


Genetic 
Signature 


GSI9 


GS20 


GS21 


GS22 


GS23 


GS24 


GS25 


GS26 


GS27 


GS30 


GS33 


GS36 


GS37 


GS38 


GS39 


GS40 


a 


GS42 


GS43 



o 
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Prediction: 


Predisposition to stroke 


Predisposition to stroke 


44 

a. 

>> 
o 
c 

V 

o 


A/A 


A/G 


Position 3 


ATl:1167: 


AGR:1218: 


a. 

o 
e= 

o 


G/G 


G/G 


Position 2 


AGR:1072: 


AGR:I072: 


Genotype! 


A/G 


JL/V 


Position 1 


AGR:1007: 


AGR:395: 


Genetic 
Signature 


GS44 


GS/45 



wo 00/22166 _Q3_ PCT/IB99/01678 

Polymorphic patterns, Le., combinations of sequences at particular 
polymorphic positions, that show a statistically significant correlation with myocardial 
infarction and stroke are shown below: 

Table 12 - Correlation with MI 

MI (n=120) 
Healthy {n=100) 



Genetic Signature Prevalence Ml Healthy P-value 



ACE:2193 AA 
ANP:1204 AC 


5.0% 


100% 


0% 


0.001 


AGR:449 TT 
AGR:1204 AC 
ATI: 1271 AC 


8.2% 


89% 


11% 


0.002 


ACE:2193 AG 
AGT:620 CT 
AGT:1167 GG 


4.5% 


100% 


0% 


0.002 


ACE:2193 AG 
AGR:449 TT 
AT1:127] AC 


17.7% 


74% 


26% 


0.007 


ACR:5349 AA 
AGR:449 TT 
ATI: 1271 AC 


14.5% 


75% 


25% 


0.013 


ACE:2193 AA 
AGT:620 CT 
ATI: 1271 AA 


3.6% 


100% 


0% 


0.009 
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Table 13 - Correlation With Stroke 

Stroke (n=37) 
Healthy (n=100) 



Genetic Signature 


Prevalence 


Stroke 


Healthy 


P-value 


ACE:2193 AA 
AGR:395 AT 


2.9% 


100% 


0% 


0.005 


AGR:1007 AG 
AGR:1072 GG 
ATI: 1167 AA 


9.5% 


85% 


15% 


0.000008 


AGR:395 AT 
AGR:1072 GG 
AGR:1218 AG 


4.4% 


83% 


17% 


0.006 



The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the 
foregoing description and the accompanying figures. Such modifications are intended to 
fall within the scope of the appended claims. 

It is further to be understood that all base sizes or amino acid sizes, and all 
molecular weight or molecular mass values, given for nucleic acids or polypeptides are 
approximate, and are provided for description. 

Various patents, patent applications, methodologies, and publications are 
cited herein, the disclosures of which are incorporated by reference in their entireties. 

Example 5: Correlation Between a Specific Polymorphism Pattern and High 
Blood Pressure Treatment with a P-Blocicer 

Using the techniques described in Examples 1 and 2, we investigated 
polymorphisms in the ADBPl and ADBR2 genes (regulator/promoter regions and coding 
regions). The source material was the same as described above. 

At the outset, we identified a set of key polymorphisms. Some of these are 
known; others are new. The are summarized in Table 14. 
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Tabic 14 -Polymorphic Positions in the Genes Encoding 
the Beta Adrenergic Receptors 1 and 2 



GeneiLocation 


Genotypes 


Dirilloo 


A/A 


A 1^ 

AJG 


G/G 


D Ir.zj / / 


C/C 


C/T 


T/T 




A/A 


AJG 


G/G 


d1R:231 


A/A 


A/G 


G/G 


blR:75o 


C/C 


C/T 


T/T 


i31K:1251 


C/C 


C/G 


G/G 


BlR:i4U3 


A/A 


A/G 


G/G 


T3 1 n . 1 coo 

B1R:1528 


A/A 


A/C 


C/C 


B2P:934 


A/A 


A/G 


G/G 


B2P:987 


C/C 


C/G 


G/G 


B2r:lu0o 


A/A 


/WG 


G/G 


B2r:l 120 


C/C 


C/G 


G/G 


"TiOT*. t 01 t 

B2P:1221 


C/C 


C/T 


T/T 


B2r:1541 


C/C 


C/T 


T/T 


B2P:1568 


C/C 


C/T 


T/T 


B2R;o39 


A/ A . 


A/G 


G/G 


B2R:872 


C/C 


C/G 


G/G 


D2K.1045 


A/A 


A/G 


G/G 


B2R:1284 


C/C 


C/T 


T/T 


B2R:1316 


A/A 


A/C 


C/C 


B2R:lg46 


C/C 


C/G 


G/G 


B2R:2032 


A/A 


A/G 


G/G 


B2R:2068 


no insertion 


insertion G/G 


insertion C/G 


B2R:2070 


no insertion 


insertion C/C 





Abbreviations used for the denotation of the genes: 
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BIP (ADBRl) - putative promoter region of the gene encoding the beta 

adrenergic receptor 1 
B 1 R ( ADBR 1 ) - the protein coding region of the gene encoding the beta 

adrenergic receptor 1 
B2P (ADBR2) - putative promoter region of the gene encoding the beta 

adrenergic receptor 2 
B2R ( ADBR2) - the protein coding region of the gene encoding the beta 

adrenergic receptor 2 

These positions, one or a combination of several, can be used to predict a 
good response to anti-hypertensive treatment with Beta-blockers. 

Indeed, we identified various polymorphisms/polymorphic patterns that can 
be used in the prediction of high response to anti-hypertensive treatment with 
beta-blockers. These are shown in Table 15. 



Table IS - Polymorphisms Predictive of P-Blocker Responsitivity 



GS 


Positions - 
genotypes 


Prevalence 
(%) 


P-value 


Comnients 


GSl 


B2R:1846-C/C 


7.7 


0.0049 


Patients show a 8.3 mmHg DBP 
reduction compared to the group lacking 
this genotype. 


GS2 


B2R:1045-A/A 


6.6 


0.0105 


Patients will show a 8.1 mmHg DBP 
reduction compared to the group lacking 

this genotype. 


GS3 


B2R:I316-A/A 


6.6 


0.0105 


Patients that will show a 8. 1 mmHg 
DBP reduction compared to the group 
lacking this genotype. 



Not only positions in the beta adrenergic receptors, but also positions in the 
renin-angiotensin- aldosterone-system can be used to predict response to anti-hypertensive 
treatment with beta- blockers (see Table 16). The numbering give according to the 
numberings in GenBank as defined in Table 1, supra. 



Table 16 - RAAS Polymorphism Associated with P-BIocker Responsitivity 



GS 


Positions - Genotypes 


Prevalence (%) 


Comments 


GS105B 


ATR:2046 - C/C + 
AT1:678-T/T 


10 


Patients will show more that 10-mmHg 
DBP (diastolic blood pressure reduction 
compared to the group lacking this 
genotype. 
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The present invention is not to be limited in scope by the specific 
embodiments described herein. Indeed, various modifications of the invention in addition 
to those described herein will become apparent to those skilled in the art from the 
foregoing description and the accompanying figures. Such modifications are intended to 
fall within the scope of the appended claims. 

It is further to be understood that all sizes and all molecular weight or 
molecular mass values are approximate, and are provided for description. 

Patents, patent applications, procedures, and publications cited throughout 
this application are incorporated herein by reference in their entireties. 
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1 1 . A method for assessing cardiovascular status in an individual to be 

2 tested comprising comparing 

3 (a) a test polymorphic pattern comprising at least one 

4 polymorphic position within a gene encoding a polypeptide selected from 

5 the group consisting of renin, aldosterone synthase, type-2 angiotensin II 

6 receptor, endothelin receptor, and p-adrenoceptor of the individual, with 
7- (b) a reference polymorphic pattern derived from a population 

8 of individuals exhibiting a predetermined cardiovascular status; and 

9 concluding whether the individual possesses the cardiovascular status based on whether 
10 the test pattern matches the reference pattern. 

1 2. The method according to claim 1 , wherein the polymorphic pattern 

2 of (a) includes a polymorphic position within a second gene encoding a polypeptide 

3 selected from the group consisting of ACE, ATI , and AGT. 

1 3. The method according to claims 1 or 2, wherein the predetermined 

2 cardiovascular status is predisposition to a cardiovascular syndrome. 

1 4. The method according to claim 3, wherein the cardiovascular 

2 syndrome is selected from the group consisting of myocardial infarction, unstable angina, 

3 hypertension, atherosclerosis, and stroke. 

1 5. The method according to claim 3, wherein the reference pattern 

2 comprises at least two polymorphisms. 

1 6. The method according to claim 5, wherein the reference pattern 

2 comprises at least three polymorphisms. 

1 7. The method according to claims 1 or 2, wherein the predetermined 

2 cardiovascular status is a response to a cardiovascular treatment regimen. 
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1 8. The method according to claim 7, wherein the treatment regimen 

2 comprises administering a cardiovascular drug selected from the group consisting of an 

3 ACE inhibitor, an angiotensin II receptor antagonist, a diuretic, an alpha-adrenoreceptor 

4 antagonist, a cardiac glycoside, a phosphodiesterase inhibitor, a beta-adrenoreceptor 

5 antagonist, a calcium channel blocker, a HMG-CoA reductase inhibitor, an imidizoline 

6 receptor blocker, an endothelin receptor blocker, and an organic nitrite. 

1 9. The method according to claim 1 , wherein the polymorphic position 

2 of the gene is selected from the group consisting of a position in the renin coding region 

3 comprising a cytosine to thymine transition in exon 10 that creates a premature stop codon 

4 at position 387 resulting in a truncated form of renin with 20 amino acids deleted from the 

5 carboxyl terminus; a position in a promoter region of aldosterone synthase numbered -344 

6 (with the initiation codon starting at 1); positions in the regulatory region of p-adrenergic 

7 receptor-1 gene (designated BIP) numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 

8 2724, and 2757; positions in the P-adrenergic receptor- 1 coding region (designated BIR) 

9 numbered 231, 758,1037, 1251, 1403, and 1528; positions in the P-adrenergic receptor-2 

10 gene regulatory region (designated B2P) numbered 932, 934, 987, 1006, 1 120, 1221, 

11 1541, and 1568; positions in the P-adrenergic receptor-2 gene coding region (designated 

12 B2R) numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070; positions 

13 in the endothelin receptor type A coding region (designated ETJ numbered 969, 1005, 

14 1 146, and 2485; and combinations of any of the foregoing, 

1 10. The method according to claim 2, wherein the polymorphic position 

2 of the second gene is selected from the group consisting of positions in the ACE 

3 regulatory region numbered 5 106, 5349, and 5496; positions in the ACE coding region 

4 numbered 375, 582, 731, 1060, 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; 

5 position 145 1 in the ACE gene as numbered in GenBank entry X62855; positions in the 

6 ACT regulatory region numbered 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 

7 1218; positions in the ACT coding region numbered 273, 620, 803, 912, 997, 1116, and 

8 1 174; position 49 in the AGT gene as numbered in GenBank entry M24688; positions in 

9 the ATI regulatory region numbered 1427, 1756, 1853, 2046, 2354, 2355, and 2415; and 

10 positions in the ATI coding region numbered 449, 678, 1 167, and 1271. 
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1 1 1 . An isolated nucleic acid encoding a polypeptide selected from the 

2 group consisting of renin, aldosterone synthase, type-2 angiotensin II receptor, endothelin 

3 receptor, and P-adrenoceptor in an individual, wherein the nucleic acid comprises a 

4 polymorphic position and wherein the polymorphic position is a position in a polymorphic 

5 pattern derived from a population of individuals exhibiting a predetermined cardiovascular 

6 status. 

1 12. A probe which hybridizes at high stringency to a polymorphic 

2 position as defined in claim 1 1 . 

1 13, A library of nucleic acids comprising a nucleic acid which 

2 comprises 

3 (a) a polymorphic position of a gene encoding a polypeptide 

4 selected from the group consisting of renin, aldosterone synthase, type-2 

5 angiotensin II receptor, endothelin receptor, and P-adrenoceptor, wherein 

6 the polymorphic position is a position in a polymorphic pattern derived 

7 from a population of individuals exhibiting a predetermined cardiovascular 

8 status, and 

9 (b) nucleic acids comprising other polymorphic positions of 

10 human genes that are indicative of cardiovascular status of an individual, 

1 1 wherein the human genes are selected from the group consisting of ACE, 

12 ATI , ACT, renin, aldosterone synthase, type-2 angiotensin II receptor, 

13 endothelin receptor, and p-adrenoceptor. 

1 14. The library of claim 1 3 wherein the nucleic acids are 

2 oligonucleotides. 

1 15. A kit for assessing cardiovascular status comprising 

2 (a) sequence determination primers and 

3 (b) sequence determination reagents, 

4 wherein the primers are selected from the group consisting of primers that hybridize to or 

5 immediately adjacent to 
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1 (i) a polymorphic position in a gene gene encoding a 

2 polypeptide selected from the group consisting of renin, aldosterone 

3 synthase, type-2 angiotensin II receptor, endothelin receptor, and p- 

4 adrenoceptor, and 

5 (ii) a second polymorphic position in a gene encoding a 

6 polypeptide selected from the group consisting of ACE, ATI, AGT, 

7 renin, aldosterone synthase, type-2 angiotensin II receptor, 

8 endothelin receptor, and P-adrenoceptor, 

1 16. The kit of claim 1 5, wherein the second polymorphic position is 

2 selected from the group consisting of positions in the ACE regulatory region numbered 

3 5106, 5349, and 5496; positions in the ACE coding region numbered 375, 582, 731, 1060, 

4 1215, 2193, 2328, 2741, 3132, 3387, 3503, and 3906; position 1451 in the ACE gene as 

5 numbered in GenBank entry X62855; positions in the AGT regulatory region numbered 

6 395, 412, 432, 449, 692, 839, 1007, 1072, 1204, and 1218; positions in the AGT coding 

7 region numbered 273, 620, 803, 912, 997, 1 1 16, and 1 174; position 49 in the AGT gene as 

8 numbered in GenBank entry M24688; positions in the ATI regulatory region numbered 

9 1427, 1756, 1853, 2046, 2354, 2355, and 2415; positions in the ATI coding region 

10 numbered 449, 678, 1 1 67, and 1 27 1 ; a position in the renin coding region comprising a 

1 1 cytosine to thymine transition in exon 1 0 that creates a premature stop codon at position 

12 387 resulting in a truncated form of renin with 20 amino acids deleted from the carboxyl 

13 terminus; a position in a promoter region of aldosterone synthase numbered -344 (with the 

14 initiation codon starting at 1); positions in the regulatory region of P-adrenergic receptor-1 

15 gene (designated B IP) numbered 2238, 2440, 2493, 2502, 2577, 2585, 2693, 2724, and 

16 2757; positions in the P-adrenergic receptor-1 gene coding region (designated BIR) 

17 numbered 231, 758, 1037, 1251, 1403, and 1528; positions in the P-adrenergic receptor-2 

18 regulatory region (designated B2P) numbered 932, 934, 987, 1006, 1 120, 1221, 1541, and 

19 1568; positions in the p-adrenergic receptor-2 gene coding region (designated B2R) 

20 numbered 839, 872, 1045, 1284, 1316, 1846, 1891, 2032, 2068, and 2070; positions in the 

21 endothelin receptor type A coding region (designated ET^^) numbered 969, 1005, 1 146, 

22 and 2485; and combinations of any of the foregoing. 
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SEQUENCE LISTING 



<110> Eurona Medical AB 

Norberg, Leif Torbjorn 
Andersson, Maria Kristina 
Lindstrom, Per Harry Rutger 

<120> GENES FOR ASSESSING CARDIOVASCULAR 

STATUS AND COMPOSITONS FOR USE THEREOF 



<130> 1248/1E973-US1 

<140> TBA 
<141> 

<160> 140 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 1 

tgcgtgcttc agaagtcc 18 

<210> 2 

<211> 19 

<212> DNA 

<213> "Artificial Sequence" 



<210> 3 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 3 

agccaggcag taatgacct 19 

<210> 4 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 



<400> 2 
ccagggaggt gaagaaatc 



19 



<400> 4 
gcccactgtt cccttatg 



18 



<210> 5 
<211> 18 
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<212> DNA 

<213> "Artificial Sequence" 



<400> 5 
tgccctgact gacagagc 



18 



<210> 6 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 
<400> 6 

gccctggtgt gcctgt 16 

<210> 7 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 



<210> 8 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 8 

tgcctggata tgtgttgc 18 

<210> 9 

<211> 16 

<212> DNA 

<213> "Artificial Sequence" 



<210> 10 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 10 

tcccctctcc ctgtacct 18 

<210> 11 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 11 

gtgctggggt agggtaga 18 



<400> 7 
tgcctggata tgtgttgc 



18 



<400> 9 
gccctcgcct ctcact 



16 



<210> 12 
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<211> 16 
<212> DNA 

<213> "Artificial Sequence" 
<400> 12 

tccccctgac ctggct 16 

<210> 13 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 

<400> 13 

ggggcaccgt gatgtt 16 

<210> 14 
<211> 16 
<212> DNA 

<213> "Artificial Sequence" 
<400> 14 

ggggcaccgt gatgtt 16 

<210> 15 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 15 

gccagagcct ttggttt 17 

<210> 16 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 16 

tggaagagcc gacttacag 19 

<210> 17 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 17 

tcccagaggc aaagagg 17 

<210> 18 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 18 

gtttctactg cggcttcat 19 
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<210> 19 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 19 

gtttctaccg cggcttcat 19 

<210> 20 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 20 

tcctggaaga gggagtttc 19 

<210> 21 
<211> 19 
<212> DNA 

<213> ''Artificial Sequence" 
<400> 21 

gcaggatgag agcaacaac 19 

<210> 22 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 22 

ctggagacca ctcccatcct ttct 24 

<210> 23 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 23 

gatgtggcca tcacattcgt cagat 25 

<210> 24 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 24 

cttccgtggg actcatgt 18 

<210> 25 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 25 

tgcaccgtga ggctcta 17 
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<210> 26 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 26 

gcccaatagg aggaagca 18 

<210> 27 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 27 

cccaccccat ctccaagaa 19 

<210> 28 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 28 

tccctgatgg gctgctctc 19 

<210> 29 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 29 

caaggccctc aaccaactc 19 

<210> 30 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 30 

ttcccacaaa agctccagtg 20 

<210> 31 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 31 

ggctcaaaat ggcaagtgtt 20 

<210> 32 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 32 

gggccatgtc cttctgactc 20 
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<210> 33 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 33 

cagcctggag gggttaaga 19 

<210> 34 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 34 

cccttctgag cgagctgagt 20 

<210> 35 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 35 

ggccatgttg agctacttca a 21 

<210> 36 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 36 

cctccagcct tgggtcttaa 20 

<210> 37 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 37 

ttcccatccc agtctctggt 20 

<210> 38 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 38 

ggcagcctgg ttgatgagt 19 

<210> 39 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 39 

attccagctc tgaaattctc tga 23 
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<210> 40 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 40 

gagcccctcc agcacctc 18 

<210> 41 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 41 

acccgagcct gcccacc 17 

<210> 42 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 42 

ggtcgggctg ggaagatc * 18 

<210> 43 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 43 

tcggctctgc cccttctc 18 

<210> 44 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 44 

gccctttctc cagcttcctc t 21 

<210> 45 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 

<400> 45 

cggcggcagc agcaaca 17 

<210> 46 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 46 

gagcccctcc agcacctc 18 
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<210> 47 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 47 

acccgagcct gcccacc 17 

<210> 48 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 48 

ggtcgggctg ggaagatc 18 

<210> 49 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 49 

tcggctctgc cccttctc 18 

<210> 50 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 50 

gccctttctc cagcttcctc t 21 

<210> 51 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 51 

cggcggcagc agcaaca 17 

<210> 52 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 52 

atggcactta aaggtcagtt aat 23 

<210> 53 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
c400> 53 

tacggaagcc caagaagtt 19 
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<210> 54 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 54 

ctccccaacg gctgtctt 18 

<210> 55 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 55 

agcagcaaca tccagttctg t 21 

<210> 56 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 56 

tcccacgctc tctggactt 19 

<210> 57 
<211> 26 
<212> DNA 

<213> "Artificial Sequence" 
<400> 57 

ctgatctcag ctacacatgg atacta 26 

<210> 58 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 58 

cctgtcttgg gtgactcttc 20 

<210> 59 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 59 

ttctgggcta aatggtgaca 20 

<210> 60 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 60 

cttgtcttcg gtgtcaagtt t 21 
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<210> 61 
<211> 18 . 
<212> DNA 

<213> "Artificial Sequence" 
<400> 61 

gggagccttg gaccacac 18 

<210> 62 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 62 

agcctgcatg aacctgtcaa 20 

<210> 63 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 63 

tggtgggcgt gttcaca 17 

<210> 64 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 64 

gccagagcca gcagaga 17 

<210> 65 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 65 

ccacattcca ggggagac 18 

<210> 66 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 66 

cctgtcttgg gtgactcttc 20 

<210> 67 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 67 

ccacattcca ggggagac 18 



-10- 



wo 00722166 



PCT/IB99/01678 



<210> 66 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 68 

gtcccttcag tgccctaata c 21 

<210> 69 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 69 

acagccagat tgaaagacac a 21 

<210> 70 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 70 

aaccctttta ctggtcatgt ga 22 

<210> 71 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 71 

cgctcatggg atgtgtgac 19 

<210> 72 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 72 

tgttttcccc agtgtctatt aga 23 

<210> 73 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 73 

gcagggtcga gttacacatt t 21 

<210> 74 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 74 

cctcaggctg tcacacacct a 21 
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<210> 75 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 75 

cggcttacct tctgctgtag t 21 

<210> 76 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 76 

ctccttgaac ctgcttgtgt t 21 

<210> 77 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 77 

gcattgaaag atgtgctgtt ct 22 

<210> 78 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 78 

taacgactac aaaagcaagt cttac 25 

<210> 79 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 

<400> 79 

agagggcagg ggagagtct 19 

<210> 80 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 80 

ggcagcaggg tcagaagt 18 

<210> 81 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 81 

gctggagagg agggttacat 20 
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<210> 82 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 82 

tgcaaacttc ggtaaatgtg t 21 

<210> 83 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 83 

cagaacaacg gcagcttct 19 

<210> 84 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 84 

actggctgac ttatgctttt tact 24 

<210> 85 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 85 

gggttgaatt ttgggactca ta 22 

c210> 86 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 86 

gccagtttgc cagctataat 20 

<210> 87 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 87 

tgatgcctag ttgaatcaat aca 23 

<210> 88 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 88 

gaaggcctat gaaattcaga aga 23 
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<210> 89 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 89 

aaagtcggtt cagtccacac aa 22 

<210> 90 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 90 

aaacagcttg gtggtgatag tc 22 

<210> 91 
<:211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 91 

gcaggtgact ttggctacaa 20 

<210> 92 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 92 

aggaaacagg aaacccagta tat 23 

<210> 93 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 93 

cctgtacgct agtgtgtttc tact 24 

<210> 94 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 

<400> 94 

ctggattccc caccaaatat 20 

<210> 95 
<211> 23 
<212> DNA. 

<213> "Artificial Sequence" 
<400> 95 

tgctccttct ttcacaaaat tac 23 
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<210> 96 
<211> 26 
<212> DNA 

<213> "Artificial Sequence" 
<400> 96 

cttccgttat tatgtgtgat attagt 26 

<210> 97 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 97 

gcatgtacct aaaaagtcct gtc 23 

<210> 98 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 98 

attggcatat ccatcacctt aa 22 

<210> 99 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 99 

gatctcccaa ctcatgctat ga 22 

<210> 100 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 100 

attggattca atttgcctac at 22 

<21G> 101 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 

<400> 101 

tttggtaata cagttgtgga tcata 25 

<210> 102 
<211> 20 
<212> DNA 

<213> "Artificial Sequence" 
<400> 102 

tgcaacttgg gtagcatgtc 20 
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<210> 103 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 103 

agtcgtcccg tgtcaactat c 21 

<210> 104 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 104 

cgttgtcttc cgttattatg tgt 23 

<210> 105 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 105 

ttattgcatg tacctaaaaa gtgta 25 

<210> 106 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 106 

gcattcatat aaagatcaaa tcagt 25 

<210> 107 
<211> 23 
<212> DNA 

<213> "Artificial Sequence" 
<400> 107 

caccctgata acaaaaccag ata 23 

<210> 108 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 

<400> 108 

ctttctggca tcaacctcac t 21 

<210> 109 
<211> 25 
<212> DNA 

<213> "Artificial Sequence" 
<400> 109 

acttttaagg acgaattaga gaact 25 



16- 



wo 00/22166 



PCT/IB99/01678 



<210> 110 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 110 

gCccaccctt gaatttcata ac 22 

<210> 111 
<211> 18 
<212> DNA 

<213> "Artificial Sequence" 
<400> 111 

cccaacctcc tccctctc 18 

<210> 112 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 112 

gctcgctctc cctcacgac 19 

<210> 113 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 113 

tccagccgct ccccatc 17 

<210> 114 
<211> 19 
<212> DNA 

<213> "Artificial Sequence" 
<400> 114 

gctcgctctc cctcacgac 19 

<210> 115 
<211> 17 
<212> DNA 

<213> "Artificial Sequence" 
<400> 115 

tccagccgct ccccatc 17 

<210> 116 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 116 

gcccctcaga taatgtaagc tc 22 
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<210> 117 
<211> 21 
<212> DNA 

<213> "Artificial Sequence" 
<400> 117 

aaccggcacg aaaactttac t 21 

<210> 118 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 
<400> 118 

gcacttcact accaaatgag ca 22 

<210> 119 
<211> 22 
<212> DNA 

<213> "Artificial Sequence" 



c210> 120 

<211> 18 

<212> DNA 

<213> "Artificial Sequence" 

<400> 120 

caggaaacag ctatgact 16 

<210> 121 

<211> 29 

<212> DNA 

<213> "Artificial Sequence" 



<210> 122 
<211> 24 
<212> DNA 

<213> "Artificial Sequence" 
<400> 122 

cgacgttgta aaacgacggc cagt 24 

<210> 123 

<211> 1278 

<212> DNA 

<213> Homo sapien 



<400> 119 
gcacttcact accaaatgag cc 



22 



<400> 121 
agtcacgacg ttgtaaaacg acggccagt 



29 



<400> 123 

ccagacaagt gatttttgag gagtccctat ctataggaac aaagtaatta aaaaaatgta 



60 
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tttcagaatt tacaggccca tgtgagatat gattttttta aatgaagatt tagagtaatg 
ggtaaaaaag aggtatttgt gtgtttgttg attgttcagt cagtgaatgt acagcttctg 
cctcatatcc aggcaccatc tcttcctgct ctttgttgtt aaatgttcca ttcctgggta 
atttcatgtc tgccatcgtg gatatgccgt ggctccttga acctgcttgt gttgaagcag 
gatcttcctt cctgtccctt cagtgcccta ataccatgta tttaaggctg gacacatcac 
cactcccaac ctgcctcacc cactgcgtca cttgtgatca ctggcttctg gcgactctca 
ccaaggtctc tgtcatgccc tgttataacg actacaaaag caagtcttac ctataggaaa 
ataagaatta taaccctttt actggtcatg tgaaacttac catttgcaat ttgtacagca 
taaacacaga acagcacatc tttcaatgcc tgcatcctga aggcattttg tttgtgtctt 
tcaatctggc tgtgctattg ttggtgttta acagtctccc cagctacact ggaaacttcc 
agaaggcact tttcacttgc ttgtgtgttt tccccagtgt ctattagagg cctttgcaca 
gggtaggctc tttggagcag ctgaaggtca cacatcccat gagcgggcag cagggtcaga 
agtggccccc gtgttgccta agcaagactc tcccctgccc tctgccctct gcacccccgg 
> cctgcatgtc cctgtggcct cttgggggta catctcccgg ggctgggtca gaaggcctgg 
gtggttggcc tcaggctgtc acacacctag ggagatgctc ccgtttctgg gaaccctggc 
cccgactcct gcaaacttcg gtaaatgtgt aactcgaccc tgcaccggct cactctgttc 
agcagtgaaa ctctgcatcg atcactaaga cttcctggaa gaggtcccag cgtgagtgtc ' 
gcttctggca tctgtccttc tggccagcct gtggtctggc caagtgatgt aaccctcctc 
tccagcctgt gcacaggcag cctgggaaca gctccatccc cacccctcag ctataaatag 
ggcctcgtga cccggccagg ggaagaagct gccgttgttc tgggtactac agcagaaggt 
aagccggggg ccccctca 

<210> 124 
<211> 1347 
<212> DNA 
<213> Homo sapien 

<400> 124 

gtccatctcc tcatctcctc ttctcataag gacacaggtc atattagatc agggctcacc 60 

ctcatggcct cattttaact taatcatctc tttaaagatc ctgtctccaa ataatggtca 120 

cattctaggt cctggggttt aggacttcaa cacgggcatt atggccgttg gggaggtagg 180 

acataattca gctgatattg tgcattttgc acttggatca tgtagatatt ttccatggag 240 

ctttgaatcc atttcttctt ttttttgtag acatgaatga tttattctgg gctaaatggt 300 

gacaggaata ttgagacaat gaaagatctg gttagatggc acttaaaggt cagttaataa 360 

ccacctttca ccctttgcaa aatgatattt caggtatgcg gaagcgagca ccccagtctg 420 

agatggctcc tgccggtgtg agcctgaggg ccaccatcct ctgcctcctg gcctgggctg 480 

gcctggctgc aggtgaccgg gtgtacatac accccttcca cctcgtcatc cacaatgaga 540 

gtacctgtga gcagctggca aaggccaatg ccgggaagcc caaagacccc accttcatac 600 

ctgctccaat tcaggccaag acatcccctg tggatgaaaa ggccctacag gaccagctgg 660 

tgctagtcgc tgcaaaactt gacaccgaag acaagttgag ggccgcaatg gtcgggatgc 72 0 

tggccaactt cttgggcttc cgtatatatg gcatgcacag tgagctatgg ggcgtggtcc 780 

atggggccac cgtcctctcc ccaacggctg tctttggcac cctggcctct ctctatctgg 840 

gagccttgga ccacacagct gacaggctac aggcaatcct gggtgttcct tggaaggaca 900 

agaactgcac ctcccggctg gatgcgcaca aggtcctgtc tgccctgcag gctgtacagg 960 

gcctgctagt ggcccagggc agggctgata gccaggccca gctgctgctg tccacggtgg 1020 

tgggcgtgtt cacagcccca ggcctgcacc tgaagcagcc gtttgtgcag ggcccggctc 1080 

tctatacccc tgtggtcctc ccacgctctc tggacttcac agaactggat gttgctgctg 1140 

agaagattga caggttcatg caggctgtga caggatggaa gactggctgc tccccgatgg 1200 

gagccagtgt ggacagcacc ctggctttca acacctacgc ccacttccaa ggcaaggcaa 126 0 

acctctctgc tggctctggc cctaggactt agtatccatg tgtagctgag atcagccagt 1320 

caggccttgg agatgggcag ggggcag 134 7 

<210> 125 
<211> 377 
<212> DNA 



120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900. 
960 
1020 
1080 
1140 
1200 
1260 
1278 
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<213> Homo sapien 



<400> 125 

cctgcccctg tcttgggtga ctcttccctc cctgtctcct 
aagggcttct ccctgctggc cgagccccag gagttctggg 
tctgttccca tgctctctgg catgggcacc ttccagcact 
ttctcggtga ctgaagtgcc cttcactgag agcgcctgcc 
tatgcctctg acctggacaa ggcggagggt ctcactttcc 
atgaagaaac tgtctccccg gtagagccct cccggtctcc 
ctctcctgac ccaggct 

<210> 126 
<211> 273 
<212> DNA 
<213> Homo sapien 

<400> 126 

cctctgggag agccctcact gtgcggcctg gagccttcct aactgtgcat catctcccca 60 

ggaccatcca cctgaccatg ccccaactgg tgctgcaagg atcttatgac ctgcaggacc 120 

tgctcgccca ggctgagctg cccgccattc tgcacaccga gctgaacctg caaaaattga 180 

gcaatgaccg catcagggtg ggggaggtat ttaccttcct tgcctacctg gtccattgca 240 

caggtgagca tgattaagga aaagagctat ggt ' 273 

<210> 127 
<211> 945 
<212> DNA 
<213> Homo sapien 



gtctgatttc agggaagatg 60 

tggacaacag cacctcagtg 120 

ggagtgacat ccaggacaac 180 

tgctgctgat ccagcctcac 24 0 

agcaaaactc cctcaactgg 300 

cctggaatgt gggagccaca 360 

377 



<400 
agcccaccgc 
tcctgatcct 
cggatgagag 
tgaccctgaa 
tcctgggccg 
gcctggcaag 
ggacaaatca 
gagctggaaa 
gcctgcagcg 
gggaggcaag 
agcttgtttg 
aaaattaaag 
aacatgacct 
ttgaacaata 
ggttatttct 
ctttattaga 



> 127 
cggccctcta 
gatgtctggt 
agagcccaca 
ccgcccattc 
cgtggccaac 
gcctctgccc 
gcgatgtgtc 
gcagccgttt 
gcacaaatgc 
aaccagtgtt 
tgaaacaaaa 
tatacatttt 
ccgtgtagtg 
cgtgaaagat 
cccttgtgtt 
cgccaacaga 



gccctcacga 
cctttgcagg 
gagtctaccc 
ctgtttgctg 
ccgctgagca 
ctggcctttg 
acccccagtc 
ctccttggtc 
acctcccagt 
tagcgcggga 
aagtgttccc 
tgcattgcct 
tctgtaatac 
gcaagcacct 
agtaataaac 
tgtatacatt 



ccctgggtca 
tgctgaacag 
aacagcttaa 
cgtatgatca 
cagcatgagg 
aggcaaaggc 
tcccaccttt 
taagtgtgct 
ttgctgggtt 
ctactgttcc 
ttttcaagtt 
tcggtttgta 
cttagttttt 
gaatttctgt 
gtcttgccac 
cagccagata 



cccatgcgcc 
catttttttt 
caagcctgag 
aagcgccact 
ccagggcccc" 
cagcagcaga 
tcttctaatg 
gcatggagtg 
tattttagag 
aaaaagaatt 
gagaacaaaa 
tttagtgtct 
tccacagatg 
ttgaatgcgg 
actaagcctc 
gactg 



ctcagaatga 
gagcttgaag 
gtcttggagg 
gccctgcact 
agaacacagt 
taacaacccc 
agtcgacttt 
agcagtagaa 
aatgggggtg 
ccaaccgacc 
attgggtttt 
tgaatgtaag 
cttgtgattt 
aacaatagct 
caaatttact 



<210> 128 
<211> 1856 
<212> DNA 
<2I3> Homo sapien 

<400> 128 

gtgagagctc atgtgcaggc tgagtgagag gcgagggctg ggactggcat ggggcccggg 
ggtgctgggt gagagcacag agttgggccc ccctcgctct cggggtcagc gtgcccagga 
aatgcccttt cttgttttcc acgagggggg cttctctgcc cactgagagc cggcacctac 



60 
120 
180 

240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
945 



60 
120 
180 
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ttcataccat gccccgatca cccgcccctc cctcagaacc gccctctgct caagggtgtc 240 

cactctctcc tgtcctctct gcatgccgcc cctcagagca gcgggatctc aaagttatat 300 

ttcacgggct tggactccaa acggggggaa ctcggggaca ctagctcccc ccggcctcct 360 

ttcgcgaccc tgcccttgac uccctcacct tctctgtctt Ccctgagccc ctctcccagc 420 

atgtgactga taaggaaatt gagtcacaca gcccctgaaa gcgccagact agaacctgag 480 

cctctgattc ctctcacttc cctcccctac cctgccactt cctactggat agaagtagac 540 

agctcttgac tgtcctcttt tctccccact ggctggtcct tcttagcccc agcccgtttg 600 

aaagagctca cccccgacac aaggacccgc acacagatac ctcccagctc cctctcaacc 660 

caccctttcc agggttggag aacttgaggc ataaacattc utccatgagg aatctccacc 720 

cagaaatggg tctttctggc ccccagccca gctcccacat tagaacaatg acaaatagaa 780 

ggggaaacgg aaaataaaca ggagaaacgg ttttcccagg acagggtttg gcctacaagt 840 

tgtggatgtg ggtacccatg ccaagtgtga ggggaggctg gccgggcgtg gtggctcatg 900 

ctctaatccc agcactttgg gaggccaagg tgagtagatc acttgaggcc gggagcttga 960 

gaccagcctg gccaacatgg cgaaacccca tctgtactaa aaatacaaaa gttagctggg 1020 

cgtggtggta gatgcctgta gccccagcta cttgggaggc tgaggcatga gaatcgcttg 1080 

agcccagcca gggcaataca gcaagacccc gtctctacaa ataaaataca aaaaattagt 114 0 

tggacgtggt ggtgcatgcc tgtagtccta gctgctaggg aggctgagat ggaaggattg 1200 

cttgagcctg ggaggtcaag gccgcagtga gccgagatgg cgccactgca ctccagcctg 126 0 

ggcaacagag tgagaccctg cctcagaaag aaaaaaaaaa aaaaaggaga ggagagagac 1320 

tcaagcacgc ccctcacagg actgctgagg ccctgcaggt gtctgcagca tgtgcccagg 13 80 

ccggggactc tgtaagccac cgctggagac cactcccatc ctttctccca tttctctaga 1440 

cctgctgcct atacagtcac cctttttttt tttttgagac ggagtctcgc tctgtcgccc 1500 

aggctggagt gcagtggcgg gatctcggct cactgcaacg tccgcctccc gggttcacgc 1560 

cattctcctg cctcagcctc ccaagtagct gggaccacag cgcccgccac tacgcccggc 1620 

taattttttg tatttttagt agagacgggg tttcaccgtt ttagccggga tggtctcgat 1680 

ctcctgacct cgtgatccgc ccgcctcggc ctcccaaagt gctgggatta caggcgtgat 1740 

acagtcactt ttatgtggtt tcgccaattt tattccagct ctgaaattct ctgagctccc 1800 

cttacaagca gaggtgagct aagggctgga gctcaagcca ttcaaccccc taccag 1856 

<210> 129 
<211> 4020 
<212> DNA 
<213> Homo sapien 

<400> 129 

gccgagcacc gcgcaccgcg tcatgggggc cgcctcgggc cgccgggggc cggggctgct 60 

gctgccgctg ccgctgctgt tgctgctgcc gccgcagccc gccctggcgt tggaccccgg 120 

gccgcagccc ggcaactttt ctgctgacga ggccggggcg cagctcttcg cgcagagcta 180 

caactccagc gccgaacagg tgctgttcca gagcgtggcc gccagctggg cgcacgacac 24 0 

caacatcacc gcggagaatg caaggcgcca ggaggaagca gccctgctca gccaggagtt 3 00 

tgcggaggcc tggggccaga aggccaagga gctgtatgaa ccgatctggc agaacttcac 360 

ggacccgcag ctgcgcagga tcatcggagc tgtgcgaacc ctgggctctg ccaacctgcc 420 

cctggctaag cggcagcagt acaacgccct gctaagcaac atgagcagga tctactccac 480 

cgccaaggtc tgcctcccca acaagactgc cacctgctgg tccctggacc cagatctcac 540 

caacatcctg gcttcctcgc gaagctacgc catgctcctg tttgcctggg agggctggca 600 

caacgctgcg ggcaccccgc cgaaaccgct gtacgaggat ttcactgccc tcagcaatga 660 

agcctacaag caggacggct ccacagacac gggggcctac tggcgctcct ggtacaactc 720 

ccccaccttc gaggacgatc cggaacacct ctaccaacag ctagagcccc tctacctgaa 780 

cctccacgcc ttcgcccgcc ccgcactgca tcgccgatac ggagacagat acatcaacct 840 

caggggaccc atccctgctc atctgctggg agacatgtgg gcccagagct gggaaaacat 900 

ctacgacatg gtggtgcctc ccccagacaa gcccaacctc gatgtcacca gcaccatgct 960 

gcagcagggc tggaacgcca cgcacatgtt ccgggtggca gaggagttct tcacctccct 1020 

ggagctctcc cccatgcctc ccgagctctg. ggaagggtcg atgctggaga agccggccga 1080 

cgggcgggaa gtggtgtgcc acgcctcggc ctgggacttc tacaacagga aagacttcag 1140 

gatcaagcag tgcacacggg ccacgatgga ccagctctcc acagtgcacc acgagatggg 1200 
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ccatatacag 
ccccggcttc 
tctgcacaaa 
cttgctaaaa 
gtggcgctgg 
gtatcttcga 
tgatgctgga 
ttttgtcctg 
actgcaccag 
gcaggctggc 
cctggatgcc 
gaaccagcag 
tgacaactac 
ggaggaatat 
ctacaacacc 
agccaaccac 
gaacaccact 
tgcccaggag 
ggccactgtg 
gatggccaca 
ggcggggaga 
ccggctcaat 
cctggagcaa 
tgcctacgtg 
gcccattcct 
cttggtggtg 
gggctggacg 
gctgcccgtg 
ggaggtggtc 
gcagtgcacc 
ccagtatttc 
cttccatgag 
cagtctcaac 
gaagatggcc 
ctggagggta 
caggctgaag 
aggggccaag 
catccagttc 
caagtgtgac 
gggcttcagt 
cgcctcggcc 
gctgcatggg 
agaagggccc 
gcaggcccgc 
cctgggcctc 
cgggccccag 
ggccctgccc 



tactacctgc 
catgaggcca 
atcggcctgc 
atggcactgg 
ggggtcttta 
accaagtatc 
gctaagtttc 
cagttccagt 
tgcgacatct 
tcctccaggc 
cagccgctgc 
aacggcgagg 
ccggagggca 
gaccggacat 
aacatcacca 
accctgaagt 
atcaagcgga 
ctggaggagt 
tgccacccga 
tcccggaaat 
gccatcctcc 
ggctatgtag 
gacctggagc 
cgccgggccc 
gctcacctgc 
cccttccctt 
cccaggagga 
cctcctgagt 
tgccacgcct 
accgtgaact 
atgcagtacd 
gccattgggg 
ctgctgagca 
cttgacaaga 
tttgatggaa 
taccagggcc 
ttccacattc 
cagttccacg 
atctaccagt 
aggccgtggc 
atgttgagct 
gagaagctgg 
ctcccagaca 
gtgggccagt 
agccagcggc 
ttcggctccg 

aagggcctcc 



agtacaagga 
ttggggacgt 
tggaccgtgt 
aaaaaattgc 
gtgggcgtac 
aggggatctg 
atgttccaaa 
tccatgaagc 
accggtccac 
cctggcagga 
tcaagtactt 
tcctgggctg 
tagacctggt 
cccaggtggt 
cagagaccag 
acggcaccca 
tcataaagaa 
acaacaagat 
atggcagctg 
atgaagacct 
agttttaccc 
atgcagggga 
ggctcttcca 
tgcaccgtca 
tggggaacat 
cagccccctc 
tgtttaagga 
tctggaacaa 
cggcctggga 
tggaggacct 
aagacttacc 
acgtgctagc 
gcgagggtgg 
tcgcctttat 
gcatcaccaa 
tctgcccccc 
cttctagcgt 
aggcactgtg 
ccaaggaggc 
cggaagccat 
acttcaagcc 
gctggccgca 
gcggccgcgt 
ggctgctgct 
tcttcagcat 
aggtggagct 
caccagagac 



tctgcccgtc 
gctggcgctc 
caccaatgac 
cttcctgccc 
ccccccttcc 
tcctcctgtt 
tgtgacacca 
cctgtgcaag 
caaggcaggg 
ggtgctgaag 
ccagccagtc 
gcccgagtac 
gactgatgag 
gtggaacgag 
caagattctg 
ggccaggaag 
ggttcaggac 
cctgttggat 
cctgcagctc 
gttatgggca 
gaaatacgtg 
ctcgtggagg 
ggagctgcag 
ctacggggcc 
gtgggcgcag 
gatggacacc 
ggctgatgat 
gtcgatgctg 
cttctacaac 
ggtggtggcc 
tgtggccttg 
cctctcagtg 
cagcgacgag 
ccccttcagc 
ggagaactat 
agtgcccagg 
gccttacatc 
ccaggcagct 
cgggcagcgc 
gcagctgatc 
gctgctggac 
gtacaactgg 
cagcttcctg 
cttcctgggc 
ccgccaccgc 
gagacactcc 
tgggatggga 



tccctgcgtc 
tcggtctcca 
acggaaagtg 
tttggctacc 
cgctacaact 
acccgaaacg 
tacatcaggt 
gaggcaggct 
gccaagctcc 
gacatggtcg 
acccagcggc 
cagtggcacc 
gctgaggcca 
tatgccgagg 
ctgcagaaga 
tttgatgtga 
ctagaacggg 
atggaaacca 
gagccagatc 
tgggagggct 
gaactcatpa 
tctatgtacg 
ccactctacc 
cagcacatca 
acctggtcca 
acagaggcta 
ttcttcacct 
gagaagccaa 
ggcaaggact 
caccacgaaa 
agggagggtg 
tctacgccca 
catgacatca 
tacctcgtcg 
aaccaggagt 
actcaaggtg 
aggtactttg 
ggccacacgg 
ctggcgaccg 
acgggccagc 
tggctccgca 
acgccgaact 
ggcctggacc 
atcgccctgc 
agcctccacc 

tgaggtgacc 

acactggtgg 



ggggggccaa 

ctcctgaaca 
acatcaatta 
tggtggacca 
tcgactggtg 
aaacccactt 
actttgtgag 
atgagggccc 
ggaaggtgct 
gcttagatgc 
tgcaggagca 
cgccgttgcc 
gcaagtttgc 
ccaactggaa 
acatgcaaat 
accagttgca 
cagcgctgcc 
cctacagcgt 
tgacgaatgt 
ggcgagacaa 
accaggctgc 
agacaccatc 
tcaacctgca 
acctggaggg 
acatctatga 
tgctaaagca 
ccctggggct 
ccgacgggcg 
tccggatcaa 
tgggccacat 
ccaaccccgg 
agcacctgca 
actttctgat 
atcagtggcg 
ggtggagcct 
actttgaccc 
tcagcttcat 
gccccctgca 
ccatgaagct 
ccaacatgag 
cggagaacga 
ccgctcgctc 
tggatgcgca 
tggtagccac 
ggcactccca 
cggctgggtc 
gcagctgagg 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1960 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 



<210> 130 
<211> 2720 
<212> DNA. 
<213> Homo sapien 

<400> 130 

aagcttgctg gggttttgat agagattttg tttaacctgt agaccatttg aagattaatg 



60 
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ccattgtaac gatattaaat ctttcaatcc aagaacatgg aatgtcattc catttattta 120 

ggtctacctt atttcaacaa ttctttttgt ttgttttcag actacaagtt ttagatcctt 180 

ttgttaaatt tatttcttag ggtttttttt gttttgtttt gttttgttgg ttggttggtt 240 

tgttttgaga tggagtctca ctctgtcacc caggctggag tgcagtggca caatctcagc 300 

tcacagcaac ctctacctcc tgggttcaag cgattattct gcctcagcct cctcctcctg 360 

agtagctgga actacaggca tgcaccacca cgcctggcct tttttttttt tttttctttt 420 

gcatttttag tagagacagg gtttcacgat gttggccagc ctggtctcga atccctgacc 480 

ttgtgattca cccacctcgg cctcccaaag tgctgagatt acaggagtga gccactacac 540 

caggtcattt cttgatattt ttactctttt gatctatagt aagtaaaatt gtttttatct 600 

ttgaattttt aaatttttaa cacagttcaa atcagtgtgt ctgatttcat ctccttctct 660 

aacaaaccag ggtgccagaa ctgcttcagt ttctctgcct tctctttgtc tatgatgact 720 

aatgtacgaa ggtatctgct gcatcaaact ttaaacttca cattatcctt atttctcttg 780 

accttgacag atctggcatc ttttcacctg gtcgtaagca gaaagtcctt gatctcctta 840 

actttttgag gcatggcagc atgtgaggca gggagaggac acagacccac acagcaagtg 900 

gtgagaagcc aacagtggaa tcgttttctt aattccattt gttgattgtt tattgctagt 960 

gtatagaaat acaactgatt tttgtatatt gatcttgtat tctaaaaact tgctcaactt 1020 

gtttcttagt tctaatagtt aattaattga ttccttaggg ctttttaata caagatcatg 1080 

tcatctacaa atagaaattg ttttactttc tttctaatct ggatgccatt tatctttttt 1140 

tcttgtccaa ttgccctcac tagaaccttt agtacaaagt taaatagaaa tgggaagact 1200 

agacattttg tcttgttcct gatcttagac ataaaaacgt tgtcttccgt tattatgtgt 1260 

gatattagtt aagttaagtt tttcataaat aaacttcaca gtttgaggaa gttcctattc 1320 

ctaatttgtt gagtgttagc atgaaaaagt gttgaatttt gtccaagagt ttttaaaaat 1380 

ttttttagac aatcatgtag gctttgtcca ttttttactt ctttaaattt attttatttg 1440 

atacacaata gatgtacact ttttaggtac atgcaataat ttaatgcccc tcactataaa 1500 

ttcggagctg cctcctcgcc gatgattcca gcgcctgaca gccaggaccc caggcagcag 1560 

cgagtgacag gactttttag gtacatgcaa taatttaatg cattcatata aagatcaaat 1620 

cagtgcaatt ggcatatcca tcaccttaaa tatttgtctt tttcttcatg ctagaaacat 1680 

tcaagttatt ttctcctagc tactctgaaa tatacaatag attactgtaa actacagtca 1740 

ccctactcac ctatctaaca ttaattgatt tttggtaaac taatctaatc ttgctttctg 1800 

gcatcaacct cacttgacca tggtgtatag tccctttcat atgttattgg attcaatttg 1860 

cctacatttt gttgagaatt tttatctata ctcttaagaa atattgatct gtagtctcgt 1920 

gatgtcttta tctggttttg ctatcagggt gatactggcc tcatagcatg agttgggaga 1980 

tcatccttac tcttctattt tttggaagag tttgtgaaga attgatatta tttcttcttt 204 0 

aaatatttat tgggttttta aaatacattt ttaaaatgca acttgggtag catgtccaat 2100 

aggaacaaat gagtgtccac ccttgaattt cataaccctc ggaattaatc catgtaatct 2160 

atgatccaca actgtattac caaagttcga gttactcata ggaaagagaa agaagttctc 2220 

taattcgtcc ttaaaagttt tccaagttca gaaaaaaaaa atgttgaaga acacgaactc 2280 

ccgcaggaaa tgatactcct gtacccccag ctcgctctcc ctcacgaccc ctcgctaggc 2340 

g999ttcggg accaggtgaa cgctgatctg atagttgaca cgggacgact gtggcaccat 2400 

ccttgctgcc gtcaatatcc cgagagggag gaggttgggc cgggagggtc tccggggcgg 24 60 

ggcggaggag gagggaatgc aaaacagagc ctcgtccccg gaacccaaga agcagcaacg 252 0 

cccctcacta taaattcgga gctgcctcct cgccaatgat tccagcgcct gacagccagg 2580 

accccaggca gcagcgagtg acaggacgtc tggaccggcg cgccgctagc agctctgccg 264 0 

ggccgcggcg gtgatcgatg gggagcggct ggagcggacc cagcgagtga gggcgcacag 2700 

ccgggacgcc gaggcggcgg 2720 

<210> 131 
<211> 480 
<212> DNA 
<213> Homo sapien 

<400> 131 

gaattctcag agctggcgaa acagtctggt ccaagcagcc tctcagcagt gcctttcagc 60 

ctcccctctc tgagtctttc caccccttgc tggtacttta gtctcttcca cttctagcac 120 

cacgtgtagt ttcccaattt ctcttaccca aatttgctca cagggaaaaa aataaattaa 180 



-23- 



wo 00/22166 



PCT/IB99/0I678 



attagccatt tacaccacag tgtgaactta ataacaccaa caaaagttcc aaagctctag 240 

ggtctcatag cacctccaga tccatgatct cattcggtgt ttccaacaat gttttgcacc 300 

aaactggaca catgcttgct acttcatcat cctcatcgtg aacattatta ttattatcat 360 

cattctccag atgaagaaaa cgaatcacaa gtcaactgac agcccaaagg ctccacagct 420 

cagaggaggt aaatcatgtg cttaattcag aacttttggc tcccatcact atgctcttcc 480 

<210> 132 

<211> 2268 

<212> DNA 

<213> Homo sapien 

<400> 132 

accccaggca gcagcgagtg acaggacgtc tggaccggcg cgccgctagc agctctgccg 60 

ggccgcggcg gtgatcgatg gggagcggcc ggagcggacc cagcgagcga gggcgcacag 120 

ccgggacgcc gaggcggcgg gcgggagacc cgcaccagcg cagccggccc tcggcgggac 180 

gtgacgcagc gcccggggcg cgggtttgat atttgacaaa ttgatctaaa atggctgggt 240 

ttttatctga ataactcact gatgccatcc cagaaagtcg gcaccaggtg tatttgatat 300 

agtgtttgca acaaattcga cccaggtgat caaaatgatt ctcaactctt ctactgaaga 360 

tggtattaaa agaatccaag atgattgtcc caaagctgga aggcataatt acatatttgt 420 

catgattcct actttataca gtatcatctt tgtggtggga atatttggaa acagcttggt 4 80 

ggtgatagtc atttactttt atatgaagct gaagactgtg gccagtgttit ttcttttgaa 540 

tttagcactg gctgacttat gctttttact gactttgcca ctacgggotg tctacacagc 600 

tatggaatac cgctggccct ttggcaatta cctatgtaag attgcttcag ccagcgtcag 660 

tttcaacctg tacgctagtg tgtttctact cacgtgtctc agcattgatc gatacctggc 720 

tattgttcac ccaatgaagt cccgccttcg acgcacaatg cttgtagcca aagtcacctg 780 

catcatcatt tggctgctgg caggcttggc cagtttgcca gctataatcc atcgaaatgt 840 

atttttcatt gagaacacca atattacagt ttgtgctttc cattatgagt cccaaaatcc 900 

aacccttccg atagggctgg gcctgaccaa aaatatactg ggtttcctgt ttccttttct 960 

gatcattctt acaagttata ctcttatttg gaaggcccta aagaaggctt atgaaattca 1020 

gaagaacaaa ccaagaaatg atgatatttt taagataatt atggcaattg tgcttttctt 1080 

tttcttttcc tggattcccc accaaatatt cacttttctg gatgtattga ttcaactagg 114 0 

catcatacgt gactgtagaa ttgcagatat tgtggacacg gccatgccta tcaccatttg 1200 

tatagcttat tttaacaatt gcctgaatcc ccttttttat ggctttctgg ggaaaaaatt 1260 

taaaagatat tttctccagc ttctaaaata tattccccca aaagccaaat cccactcaaa 1320 

cctttcaaca aaaatgagca cgctttccta ccgcccctca gataatgtaa gctcatccac 13 8 0 

caagaagcct gcaccatgtt ttgaggttga gtgacatgtt cgaaacctgt ccataaagta 144 0 

attttgtgaa agaaggagca agagaacatt cctctgcagc acttcactac caaatgagca 1500 

ttagctactt ttcagaattg aaggagaaaa tgcattatgt ggactgaacc gacttttcta 1560 

aagctctgaa caaaagcttt tctttccttt cgcaacaaga caaagcaaag ccacattttg 1620 

cattagacag atgacggctg ctcgaagaac aatgtcagaa actcgatgaa tgtgttgatt 1680 

tgagaaattt cactgacaga aatgcaatct ccctagcctg cttttgtcct gttatttttt 1740 

atttccacat aaaggtattt agaatatatt aaatcgttag aggagcaaca ggagatgaga 1800 

gttccagatt gttctgtcca gtttccaaag ggcagtaaag ttttcgtgcc ggttttcagc 1860 

tattagcaac tgtgctacac tcgcacctgg tactgcacat cctgtacaaa gatatgctaa 1920 

gcagtagtcg ccaagttgca gacctttttg tgaaattcaa cctgtgtctt ataggtttac 1980 

actgccaaaa caatgcccgc aagatggctt atttgtataa cggtgttact aaagtcacat 2040 

ataaaagtta aactacttgt aaaggtgctg cactggtccc aagtagtagc gtcctcctag 2100 

tacattagtt cgattcaata tctgagaagt gtatatagtt cgcggtaaaa agattatata 2160 

tcataaagta tgccttcctg tttaaaaaaa gtatatattc cacacatata tatatatgta 2220 

catccacatc tctaaaccgc cgtnaattga ctaaaatctg gcaaagtt 2268 

<210> 133 
<211> 826 
<212> DNA 
<213> Homo sapien 



-24- 



wo 00/22166 



PCT/IB99/01678 



<400> 
gatctaccca 
ctctactctt 
tcacccatgt 
gccaaagtta 
gctggaaacc 
ctcatcagcc 
gtcactggac 
tccggaaacc 
actgccctgc 
atgcacggag 
gagagagccc 
gctgctgctg 
acggtaattg 
aacaccctgg 



133 
ccttggcctc 
ataattaaac 
tctagatggg 
atatagagaa 
taggttttac 
tctgcctgct 
acaagattgc 
tgggtaccct 
catctacccc 
tgtataaaag 
cagaccgagg 
ctctggggct 
gtaactcagg 
cttttccaca 



ccaaagtgct 
cagctgttgc 
aaaactgggc 
tggagcttcc 
gcttgtccca 
ccaggggtca 
tttcccacag 
tcacccacct 
agggtaataa 
gggaagggct 
gaagcatgga 
cctgtacctt 
cagagaaggg 
tgcggtgtca 



gggacaggtg 
ttttcctgcc 
tgtagctggg 
agggtatagg 
gttttgatgt 
cagggccaag 
ctgtccttcc 
agctctgtcc 
atcagggcag 
aagggagcca 
tggatggaga 
tggtctcccg 
gtgggcaggg 
ttcagtcctt 



tgagccacca 
aagaaaccag 
agaggccagt 
ggttgggtct 
tagccctgac 
ccagatagag 
tccagcccct 
cgcagtgaga 
agcagaattg 
cagaacctca 
aggatgcctc 
acagacacca 
gtgtaggttc 
acgatc 



tgcctggccc 
tcatgaagat 
cagggacaaa 
gggctaggga 
agtgccgttt 
ggctgctagc 
ctgctcccca 
tttattgctg 
caatcacccc 
gnggatctca 
gctggggact 
ccacctttaa 
ccaccttccc 



<210> 134 



} <211> 6910 
<212> DNA 
<213> Homo sapien 



<400> 134 



ggatcctgca 
accaggagga 
tcacatggaa 
gtggacattt 
taaagtctat 
ccattttact 
catggacccc 
cagcaatggg 
gccttgacct 
tatgtttcca 
gagcattgga 
cctgcaaagg 
gtttgaagcc 
ggagcagggt 
cattttcagg 
gcccaggacc 
cctttggagg 
gatagtgcag 
tgaaaggttc 

gggagggtgg 

accccgtcca 
ggtgtgtgtg 
ctgcaggatg 
cgtgttcttg 
gggtgcaggg 
ggtgggcagt 
cactgcgggt 
aagacgcaga 
gctgcttcct 
agaaggggac 
gccgcagcct 
gacagcccgg 
aagcaaggca 



aggagggata 

gaccccatgt 
ccagtgcgct 
tctgcagttt 
taaaagaatc 
cctctcagcc 
cagtccagac 
gaattaggca 
tcgctctgag 
gagcaggttc 
atggcactca 
gcacgggcac 
atgccccagc 
tatgagcacc 
taaagccctc 
ctgccaggca 
atggggaagg 
agaaaacccc 
cagggagcca 
acaggagaca 
gcagggcctc 
atgctgccgg 
atcctggagc 
ttgtaagcgg 
gggaggcgag 
gagggtgagc 
tggggaagca 
gcaatgagcc 
cttgcggtct 
ttccactggc 
ccctcagggc 
ggcaggaggc 
caaggcgggg 



caaattacat 
gactccagga 
cctgtggtgg 
ttgatcaatt 
caaggctccc 
ctggagaaag 
cccacgcctt 
cctgacttct 
agtctcaggc 
ctgggtgaga 
gggcaaaggc 
tgggcactag 
atccaggcaa 
tgcacctgga 
cctggccctc 
ctcagcactg 
agtgcagcac 
agctcactgc 
gcaggagggc 
ctttggattg 
ctgcttggcc 
aggatgtgga 
cctgggtggc 
cgagttggga 
agaggtagga 
acccggccca 
gggcggtggt 
cttctgtgta 
ggggactgtc 
ccaggtcaca 
gcccagggtc 
aggcggggac 
cggctgatgg 



acatttgtca 
ccctggttga 
agggtgtacc 
ttgcaatgaa 
tctcatctca 
gagaggccag 
ttctcagcat 
ccttcatcta 
aggtccagag 
taaaaggatt 
agaggtgtgc 
agccgctcgg 
caggtggctg 
gatgcaccag 
gctgggaaca 
ccattcccag 
atgctggtct 
agagagggca 
tttagctgtg 
ggactgcagg 
ccacaggtac 
gaagctgcaa 
ctacagacaa 
gctgagagct 
cccaaaagca 
gaggacggcc 
ggagaaatgg 
gtgagaaccc 
cttcccatag 
ggctgagtgc 
cctgcagtcc 
gcaggtggct 
cactggggag 



aaacccacag 
caacaacgta 
tgtgtcaggg 
ctaaatctgt 
cgataagata 
gtcccaccac 
cctcagacca 
cctttggctg 
ccagttctcc 
tgggctgaac 
gtggcagcgc 
gcccctagga 
aggctgctgc 
accttccagg 
cccagatccc 
caggtcccgg 
gtggtgctgc 
ggactcagaa 
aagccgctaa 
gcggggccac 
aacttgggag 
caggtggaca 
catcgtgggc 
gggagcaggg 
catctgccct 
accctgtggg 
gcacgggcac 
gctctgcacc 
gccagaaaac 
cgagcctggt 
cggcaaacct 
ggtggttccg 
gatgtttcct 



cacgttgacc 
tcgagattcc 
cagggggtac 
ggtataaaaa 
aagtccccat 
cttccaccag 
gcaggacttg 
ggggcctcca 
catgacgtga 
agggtggagg 
cctggctgtc 
cggtgctgcc 
agatctggag 
agctggggcc 
tgcccctgct 
cactctgcat 
cagggcaggg 
gcactaaagt 
tccaggagca 
gagggacatg 
gaccacgcat 
gcctgcatcc 
acaaatgtgg 
tgggcagcct 
gggcccctgt 
gtcgcgtctg 
ctctgcagag 
aacctcggcg 
tgaggccctg 
gttcgccggg 
tcctgatggg 
ttgttctcag 
ggcccgtgga 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
826 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
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gagggtggcg cctggtcagg tgggcaggga gaggctgatg cttggagtcg gtcacctgca 204 0 

gggatgttgt cattaggacg ggggaaggac tggatgagga tgtcacagtg gtgacagccc 2100 

ccactccatg gtaggaaggg aacgctattg ggaatagtgg ggtttaggta aaagggcacc 2160 

cgtgggtcgg ggccttcact gaggctggcc tatagatgac atctgggaga gagtcaggac 2220 

ccaggaaggc aggtccagga ggctgggtgc gcataatgga aggaagggga gcgctcctgt 2280 

ctgtgtgtgt gtcttgcatc tgtgcacatg ctgtgtgttt ctctgtacct gcattgcaca 2340 

tgtgtagtgt gtgcacgtgt cgtgtgtgaa tgtatgtgtg gtgtgtgtgc acaagtgtct 2400 

gtgtgtgtgc atgtgcaggt gccggcatgg gtgtagtgtt tgtgcacaca tgcacatgcg 2460 

tctcttcaca catggtgttg aggtcttgca tgggcgcacg tgtgcatgtg catcttctgc 2520 

ctgtcatcac tgtcaacagc tcacagcagc cagctggaca taaataaagg agttttgcag 2580 

gaatgtggct gacaggggaa attcctcccc accattccct gggggcatcc atggagcccc 2640 

cacgcactct ggctgtgggt aggatggcat gaagcacaaa gcttggtttc tgtcctgcag 2700 

aagatataga tgcttcacag agacagcaga gcagatgccc cagaggcact gtgcccaggg 276 0 

cggggaaggg tggggaggag agggcagcca ggggctctcc cctcaggaca ctgtgtgggt 2820 

gaggtgggca aagcttgaca acaggggtca cctcctttct tggagaaaag ccctaccctg 2880 

ttactacagg gagggcccgc atgggtgagg tggtgccaga cttgggtcgc caggtcccgg 2940 

gaatgacctc agttaccctg tcagcacctg tgggcagaag ctaccatctc atccctgctt 3000 

agacctgagt ggcctttgcc cagcacctgg aggccgctct gagaaaaggc tgcagctcga 3060 

acacaaacag gcagcttcta ccagggcccc cagtcagctc cctgcaggcc gattcccctt 3120 

ggggacaagg aggatgggat acgggtcagg gcctgtgtct tgctggggcg gcctcacaag 3180 

ctctgccccg gcctctgtag gaatgggcct gaatggcgct tcaaccgatt gcggctgaac 324 0 

ccagatgtgc tgtcgcccaa ggccgtgcag aggttcctcc cgatggtgga tgcagtggcc 3300 

agggacttct cccaggccct gaagaagaag gtgctgcaga acgcccgggg gagcctgacc 3360 

ctggacgtcc agcccagcat cttccactac accatagaag gtgtgggcca tgcgggaagg 3420 

tccagcccca gagaccctgg agtggccagg gatggggatg gaggactgaa gggagtgtgg 3480 

ggaggcagcc aggaggcctg gggctgcctt gtgctcagca gtgcatcctc cccgcagcca 354 0 

gcaacttagc tctttttgga gagcggctgg gcctggttgg ccacagcccc agttctgcca 3600 

gcctgaactt cctccatgcc ctggaggtca tgttcaaatc caccgtccag ctcatgttca 3660 

tgcccaggag cctgtctcgc tggatcagcc ccaaggtgtg gaaggagcac tttgaggcct 3720 

gggactgcat cttccagtac ggtgaggcca gggacccggg cagtgctatg gggaagggac 3780 

accatggggg cccaatttct ccttctccac cacccagtgg ggaatggagg ccacagggag 3840 

gggtcgggga ttcctcacct tcctgccggg gagattggtg cgaggctggg gctgggccgg 3900 

gctgatccgg agaatttggg atgagagcag ggagatttgg gtgtcggggc agtctgggca 3960 

ggaggaggac actgaaggat gcttcccagc accaagatct gagggctgtc ccctgctccc 4020 

tggacaggtg acaactgtat ccagaaaatc taccaggaac tggccttcaa ccgccctcaa 4 080 

cactacacag gcatcgtggc ggagctcctg ttgaaggcgg aactgtcact agaagccatc 4140 

aaggccaact ctatggaact cactgcaggg agcgtggaca cggtcaggcc agcaaccagc 4200 

cccacccaga gagggtgatg ccaagcctgc ctcccaggca ctgcctgcca atgccacacg 4260 

gcacccacgt tccccatccc caggctacag gccccacatt tctgttgccc tcagccttcc 4320 

ccctcctttg ttaagggatg agatttgcag gggaggggaa atgtgagctc cccctcacat 4380 

gagactgagt ttgcagttac ctgtgtgggg atccatgctc caggctggaa gaaagttgga 444 0 

tgaggccctg gacacacagc agctctgtcc ccactggaaa gctctgggtg tacaaggaga 4500 

aggagggttg agaggcagct ggaggactcc actgggcacc cttcccagtg tgcccggtca 4 560 

ccttgggcca gaaatgtaga tgcatgggag ggcagggttg tggggaagac agcagcacag 4 620 

gctccagcca gtgcagaggg gcctgtgggt gcacagtggg gagaactcaa tggaagcaga 4 6 80 

gggagccggg gctccagaac tccctggatg atgctgaggt gtggccccct gccctaatgg 4 74 0 

tggccgtgag aacccgccct gaagaggctg caggggacct gggccttggt ggagatgggg 4 800 

gtcacctttc cctgaagaag ccagggaatc tggcccaagt ggtcatcaag gtttcagatc 4860 

.cggggtccca gggctctgtt tttgctcagg gcatggatgt ctccacccct cagagggagg 4920 

ttgtcctggg aggggtgtcc cgggggctga gtcctcctgt gcaaggtctg accctgcaga 4980 

catggcttct gtagacagcg Cttcccttgc tgatgacgct ctttgagctg gctcggaacc 5040 

ccgacgtgca gcagaccctg cgccaggaga gcctggccgc cgcagccagc atcagtgaac 5100 

atccccagaa ggcaaccacc gagctgccct tgctgcgggc ggccctcaag gagaccttga 5160 

ggtgggtgct ggctgaggcc tccctgtggc cctggccccc tgctggagag cagcccccac 5220 

tgggtggtgg cagacagaat ctggggctga taaacagcgt cacccagcag cccattcccc 5280 
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tgcacctgct cttcctcccc ctcaaggaca gggagctctt cttcctctgg aatccctctt 5340 

caacgccctg gggattaacg tggggcatgt ccttctgcgc tcggggctgc ttaagttagg 54 00 

ggaggtttgg ctgggctcag caggtgcaag gaagcacttc ccacgacctg ggcttcccat 54 60 

ggatctggga cctctgcggg gtcttcggta ggaagggtgc tgagagcaca gggagcccca 5520 

tccagctgag gaccctttct gtggatgccc ccacctccag gctctaccct gtgggtctgt 5580 

ttttggagcg agtggtgagc tcagacttgg tgcttcagaa ctaccacatc ccagctgggg 5640 

tgagtgagcc ccacacccct cgagctgaga acctccctcc ccagtcattc cctgatccct 5700 

gctctgcacc gtccgcagac attggtacag gttttcctct actcgctggg tcgcaatgcc 5760 

gccttgttcc cgaggcctga gcggtataat ccccagcgct ggctagacat caggggctcc 5820 

ggcaggaact tccaccacgt gccctttggc tttggcatgc gccagtgcct cgggcggcgc 5880 

ctggcagagg cagagatgct gctgctgctg caccacgtaa gcaggcctgg gggcgggggc 5 940 

gggacctggg cagcagaggc gggacctgca cactgggggc ggggcttgca cggtgtgatt 6000 

gacacctggg aacagtggat ggggccttgg ttggttgagg tcggcgtgac cagggaggat 6060 

ctgtgctgag caagacaggg taggatctgg gtgaggttgc ttctaaacat tgaaatgggg 6120 

actaggggag tggggtggag cctgtacaga ataatggggc tcgggcaaga cctgggcagg 6180 

attcagtctg ggcctggtcc gcaaggtggg gctggtcaga aatgggatag gttggggccc 6240 

aggctgctgc tcccccttca gcataattgt tgcacctggg acgatgggag gaagctgccc 6300 

caggtccatg ggctactgac caggccagat ggaaacccag cctctgtcct aggtgctgaa 6360 

gcacttcctg gtggagacac taactcaaga ggacataaag atggtctaca gcttcatatt 6420 

gaggcctggc acgtcccccc tcctcacttt cagagcgatt aactagtctt gcatctgcac 6480 

ccagggtccc agcctggcca ccagcttccc tctgcctgac cccaggccac ctgtcttctc 6540 

tcccacgtgc acagcttcct gagtcacccc tctgtccagc cagctcctgc acaaatggaa 6600 

ctccccaggg cctccaggac tggggcttgc caggcttgtc aaatagcaag gccagggcac 6660 

agctggagac gatcttgctg gcagggcctg gccttgtccc cagccccacc tggccccttc 6720 

tccagcaagc agtgccctct ggacagcttg actctactcc tcccagcgct ggctccaggc 6780 

tcctcatgag gccatgcaag ggtgctgtga ttttgtccct tgccttcctg cctagtctca 6840 

catgtccctg tccctctcgc cctggccagg gcctctgtgc agacagtgtc agagtcatta 6 900 

agcgggatcc 6910 



<210> 135 

<211> 2476 

<212> DNA 

<213> Homo sapien 



<400 
gaattcttgt 
ttttttgctt 
tgtgttctaa 
catttcaata 
cttcacttcg 
aaaccatcag 
ggatttctgg 
gtttctagca 
cctctatggg 
aaagtttttg 
atgagtgttg 
tggcaagcat 
acattttatt 
gctttcccac 
cttggtttta 
ttactgaaga 
gcagctgctg 
ctggatgctc 



> 135 
tttacaagcc 
ttgacaaaca 
gagttctatg 
tgaagggcaa 
ggcttgtgaa 
ataagcattt 
tcaatattgt 
tatacatctt 
caacctatta 
gttcttttct 
ataggtacca 
cttatatagt 
ttcgagacgt 
ctgagaaata 
ttatcccttt 
cgaatagcta 
ttgttctggc 
tggcctggat 



atggctctgt 
ttcaaaatgc 
ttttttctcc 
ctccaccctt 
catctctggc 
agatgcaatt 
cgtggttaca 
caacctcgct 
ttcttataga 
taccctgaac 
atctgtcatc 
tccccttgtt 
cagaaccatt 
tgcccaatgg 
aatattcata 
tgggaagaac 
cttcatcatt 
gggtgtcatt 



ttcttaatgt 
taatgattca 
acagaaggca 
gccactacta 
aacaatgagt 
cctattcttt 
ctgttttgtt 
gtggctgatt 
catgactggc 
atgtttgcaa 
cacccccttc 
^99tgtatgg 
gaatacttag 
Lcagctggga 
gcaacatgct 
aggataaccc 
tgctggcttc 
aatagctgcg 



tttctataat 
aggatgtcct 
taagaactag 
gcaaaaacat 
ctaccttgaa 
actacattat 
gtcaaaaggg 
tactcctttt 
tctttggacc 
gcattttttt 
tgtctcaaag 
cctgtttgtc 
gagtgaatgc 
ttgccttaat 
attttggaat 
gtgaccaagt 
ccttccacgt 
aagttatagc 



cactcacttt 
cagctctgta 
gagctgctga 
taccagcggt 
ctgttcacag 
atttgtaatt 
tcctaaaaag 
ggctactctt 
tgtgatgtgc 
tatcacctgc 
aagaaatccc 
ctcattgcca 
ttgcattatg 
gaaaaatatc 
tagaaaacac 
cctgaagatg 
tctgaccttc 
agtcattgac 



60 
120 

iso 

240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
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ctggcacttc 
tattgttttg 
acttggctcc 
atggagacct 
ctttgaggct 
atcttttctg 
gaattgccca 
caacctagaa 
gatttgggga 
ccatttttat 
acaagacaac 
tactggatta 
cttgcatttc 
aggaatatag 
agatgtacta 
aagcactttt 
ctgatacttt 
tcacatcctt 
ttattaatgc 
tttaacttgt 
acatggggtc 
gaacctgtca 
ctatatctat 
caattgcagg 



cttttgccat 
ttggaaaccg 
aagggaaaag 
ttgtgtctta 
caccagaatt 
aatcttctga 
ttgtttttct 
gtaactggtg 
ttcagatttc 
caggatttcc 
attgtaagag 
ttcaggcttt 
acttgagtgg 
attaaatcac 
taacatagaa 
taaaaaactt 
tttgaaatca 
gactttttag 
tttggttctg 
aataaaccct 
tgggtggggg 
gggcagtaca 
aaacactgtt 
tctaga 



ccccttggga 
gttccaacag 
agagagtatg 
aacgtgagag 
atttttaagt 
aaccaaatgt 
gatatgtttg 
atatatctca 
tctttgaaac 
tcttgaacca 
agatgagcac 
aggcatatgc 
aggtttatag 
actcctatgc 
ttgcaatcta 
tctactcatt 
gtaaacactg 
atgtgctgct 
ggttgtttcc 
taactggcat 
caaagagacc 
atgtgacttt 
tgttccagaa 



ctcaccaaca 
aagctccgca 
tcttgccgga 
caaaatgcat 
ggctttaata 
aactatgttt 
tacaagattt 
aattgtaatt 
atgcttgtgt 
gaaccagtct 
ttctaagttg 
ttctttaaaa 
ttaatctata 
tttagcttat 
taatatttgt 
ttaatgattg 
tgtattgttg 
ttgatatata 
taaaatatct 
aggaaatggt 
cagtcaatta 
gaaaatatat 
tctgtatgat 



gctgcgttaa 
gtgtgtttag 
aaagcagttc 
gtaatcaaca 
aaataataaa 
tatcgtccag 
tcattggtga 
aataatagat 
ttcttagtgg 
ttcaactcat 
agtatattat 
acgctataaa 
actacatatt 
ttttacagtt 
gtgttcacta 
tttaaaggtt 
caaaatgtaa 
ggacattgat 

gggtggctta 

atccagaatg 
catgtttggt 
accgtggggg 
cctatggagc 



cccgtttctg 
ggttccaatt 
tcttagagaa 
tggctacttg 
atttccccta 
tgactttcag 
gacatattta 
tgtgaataat 
ggttttatat 
tgcatcattt 
aatagattag 
ctatattccc 
gaatagggct 
atagaaagca 
aactctgaat 
tctattttct 
aggtcacttt 
ttgattttta 
aaaaaaactc 
gaattttgct 
accaagaaag 
tagttttacc 
tattttaaac 



<210> 136 

<211> 3100 

<212> DNA 

<213> Homo sapien 



<400 
gaattcctga 
tataatttaa 
gaaaaaaagt 
tgtgtacggc 
gccttatcga 
ctctattccg 
ccctatgcct 
aacttcttcc 
ggaggccggg 
tcacctgagc 
aaaatacaaa 
tgaggcagga 
actgaactcc 
cgccaggagt 
ctttctaacg 
ctgataaatc 
atgatgctga 
ggtcaaaact 
acagctctgg 
accatcaacg 
gtgagcatta 
tcaaggactc 
gctgaggaca 
attcatggct 



> 136 

gatccctgta 
tgtattatac 
ttcaggaaag 
aaagttactg 
cacagaaaag 
gtaacaccac 
caccctgcag 
aatcccttgg 
gcagtggctc 
tcaggagttc 
attagccgtg 
gaatcacttg 
agcctgggca 
ctcaggcttt 
ctcttggttt 
atgggcaggg 
ggtttgggcc 
ggttgatttt 
aagtcggtct 
aacatgatgt 
tataacattt 
aattagagat 
ctacaggagc 
ctcttgtccc 



ctggtactcc 
agaaattcgg 
ttttccaaca 
gcaatctggc 
caatgccttc 
ttgacagaaa 
ctcagcactc 
ctattttgag 
atgcctctat 
gagacgagcc 
tgtggcggca 
aacccgggag 
acaaaagtga 
cactgacttg 
cctggccact 
ggtgaataaa 
taaagcaaat 
atataaactt 
ccccttgcaa 
tcgttacggg 
tcatgcaccc 
tttcctaatt 
caaattagag 
ctcaccccca 



cgctagaata 
agaaacttga 
ttccaatgtc 
ttctgccctg 
cacccttcgg 
gatggcaagg 
agaactcctt 
actgcccctt 
ccagcacttt 
taccaacatg 
cacgcctgta 
gccagaggtt 
aactctgtct 
gataacttgg 
cctcctaacc 
caaatcactc 
tcatatgttt 
gaaaaacatt 
ggtaaggata 
cgctgttcct 
tcacataaac 
tttttatttt 
aagggggctt 
gtataatcct 



accttctggt 
cgtatctctg 
acgtgtgttt 
aacctctacg 
gggcatttaa 
aaaagcctct 
tcctgggcag 
gcgggtccag 
gggaggccaa 
gtaaaacgcc 
atcccagcta 
gcagtgggcc 
caaaaaaata 
ggtgtagtct 
tccccaagtt 
cccagtttta 
gcatttatgt 
actttttgct 
tgagcttggt 
attccatgct 
aattagaagc 
tttttttatt 
gcagtcctgc 
ttttgccagg 



ttcgagggtt 
aatgagcaga 
atatatatac 
ccaagttctg 
ggttgacaca 
tccccacatt 
cgtcaatttc 
aaggaagcca 
ggtgggtgga 
atctccacta 
cttgacaggc 
aagatcacgc 
aaaagaaaga 
ccttcctccc 
ggaaaatcag 
catactacat 
aagagaaagt 
tccgacctgg 
catattgatg 
agaaggacca 
tgtttactcc 
tcaggcctga 
catattgcat 
tcaacaactc 



1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2476 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 



-28- 



wo 00/22166 



PCT/IB99/01678 



actaatttca aagcaattca tttgccaact caaaaaaaaa aaaaaaaaaa aaaacccaca 1500 

gttccaactt taaattgctt ttggatttta atcctctggg gtatttatat gtttccttgg 1560 

ctcatccctc caaaactaat ctcagagttt tgggaatctg ggaacttggg caaaggggga 1620 

aaccaggcac gaagaccaag gatctgaaat cgcagttcat tacgtcagca taaactgaca 1680 

gtaccttttt gtttgtattg acccagctca aattataaaa tcacatgagt aataaaacac 1740 

aaaatacagg tgttatcatt gaaaagtcta aaatgtaatt aaacgcgttt ttccccctcg 1800 

cggtggtacg tttactctgt atctcaacgt ttctgtctcc ctaagcctct cctttcatat 1860 

catgagcata catttttgca tcatgctcac acgttcatta gctaggactg aggagtgtgt 1920 

acgactccaa agggccctca gcgttagctg ttagatgcac aaaccttcgc ttcctttcca 1980 

catctactgc actcgaggtt caacagagga tcttgaagaa cagcgccaga gagcattctt 2040 

gacagatgcg cgcttggctc cagcaaccgc tctgctggga agtttcttct aaccactaac 2100 

accacctcca atcccccaag ctgtcacgac gcatgctggc tgggtcccgt cttgacgggg 2160 

gaagggtttt acacacaccc tgctaggctg ccccacatca caaccaagct cgcagggcaa 2220 

actcctccaa gcctggcgga caagctgtcc caggcgctct ggcgcttcct gaacaccaag 2280 

gtcccctccc ccgtcaaggg agctagcgct ctgttccgca gagaaccccg gaactgcagg 2340 

tcgaggggat gcggggggag cgggggcgca ggagggagcc gagtgctgga ggcaaacggg 2400 

gcgcaggagc gggtgcggga ggcaaacggg gcgcaggagt gggtgcggga ggcaaacggg 2460 

cgcaggagtg ggtgcgggag cgagtggggg ctcaggaggg gtcgggcccg ggggagccag 2520 

gcgcggaagg gggcgcgggg gaacagggac caggaaccag cgggcgcagg aaggggtgcg 2580 

tccgaaggaa cccgcgggcg cacgggaggc actagctacg cgatcagctc gggactctca 2640 

ggagccgctc aattgccaac gggagggggg tgcggggagt tggaggtggg ggagcagacc 2700 

agacgggggc gtgcctttgc ccgcattggc tgcaggagcc tgacgcgagg ccccgggggt 2760 

tggcttgggg agtgggagcg gggtggggtg ggtgctgggt gccggagctg cgggcccggc 2820 

gcgctcagaa acatgctgaa gccccggcgg ctcttccagc agcaggcagc ggctccagca 2880 

gcagcaggca ggcaggcagg caggcaggca ggcagcggca gcgacagcgc tcggctctgc 294 0 

gggaaaacgg gcccggcgcc catgctccgg ccccgcgggg cggctgccct gacccggccg 3 000 

cgacctccct ctgcgcacca cgccgcccgg gcttctgggg tgttccccaa ccacggccca 3060 

gccctgccac accccccgcc cccggcctcc gcagctcggc 3100 

<210> 137 
<211> 1723 
<212> DNA 
<213> Homo sapien 

<400> 137 

tgctacccgc gcccgggctt ctggggtgtt ccccaaccac ggcccagccc tgccacaccc 60 

cccgcccccg gcctccgcag ctcggcatgg gcgcgggggt gctcgtcctg ggcgcctccg 120 

agcccggtaa cctgtcgtcg gccgcaccgc tccccgacgg cgcggccacc gcggcgcggc 180 

tgctggtgcc cgcgtcgccg cccgcctcgt tgctgcctcc cgccagcgaa agccccgagc 24 0 

cgctgtctca gcagtggaca gcgggcatgg gtctgctgat ggcgctcatc gtgctgctca 300 

tcgtggcggg caatgtgctg gtgatcgtgg ccatcgccaa gacgccgcgg ctgcagacgc 360 

tcaccaacct cttcatcatg tccctggcca gcgccgacct ggtcatgggg ctgctggtgg 420 

tgccgttcgg ggccaccatc gtggtgtggg gccgctggga gtacggctcc ttcttctgcg 4 80 

agctgtggac ctcagtggac gtgctgtgcg tgacggccag catcgagacc ctgtgtgtca 54 0 

ttgccctgga ccgctacctc gccatcacct cgcccttccg ctaccagagc ctgctgacgc 600 

gcgcgcgggc gcggggcctc gcgtgcaccg tgtgggccat ctcggccctg gtgtccttcc 660 

tgcccatcct catgcactgg cggcgggcgg agagcgacga ggcgcgccgc tgctacaacg 720 

accccaagtg ctgcgacttc gccaccaacc gggcctacgc caccgcctcg tccgtagtct 780 

cctcctacgt gcccctgtgc accatggcct tcgtgtacct gcgggtgttc cgcgaggccc 84 0 

agaagcaggt gaagaagatc gacagctgcg agcgccgttt cctcggcggc ccagcgcggc 900 

cgccctcgcc ctcgccctcg cccgcccccg cgcccgcgcc gccgcccgga cccccgcgcc 960 

ccgccgccgc cgccgccacc gccccgctgg ccaacgggcg tgcgggcaag cggcggccct 102 0 

cgcgcctcgt ggccctacgc gagcagaagg cgctcaagac gctgggcatc atcatgggcg 108 0 

tcttcacgct ctgctggctg cccttcttcc cggccaacgc ggtgaaggcc ttccaccgcg 114 0 

agccggtgcc cgaccgcctc ttcgtcttct tcaactggcc gggctacgcc aactcggcct 1200 
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tcaaccccat 
gctgcgcgcg 

cgggctgtct 

acgacgatgt 
acggcggggc 
cctcggaatc 
gggaacgagg 
cctcgtctga 
tttgggaagg 



catctactgc 
cagggctgcc 
ggcccggccc 
cgtcggggcc 
ggcggcggac 
caaggtgtag 
agatctgtgt 
atcatccgag 
gatgggagag 



cgcagccccg 
cgccggcgcc 
ggacccccgc 
acgccgcccg 
agcgactcga 
ggcccggcgc 
ttacttaaga 
gcaaagagaa 
tggcttgctg 



acttccgcaa 
acgcgaccca 
catcgcccgg 
cgcgcctgct 
gcctggacga 

ggggcgcgga 

ccgatagcag 
aagccacgga 
atgttccttg 



ggccttccag 
cggagsiccgg 
ggccgcctcg 
ggagccctgg 
gccgtgccgc 
ctccgggcac 
gtgaactcga 
ccgttgcaca 
ttg 



ggactgctct 
ccgcgcgcct 
gacgacgacg 
gccggctgca 
cccggcttcg 
ggcttcccag 
agcccacaat 
aaaaggaaag 



<210> 138 

<211> 3451 

c212> DNA 

<213> Homo sapien 



<400> 138 



cccgggttca 
ccaccacacc 
aggatctttt 
tgaaccatat 
tggcacaacc 
ttgggtttac 
gtgagtgatg 
cacacaactt 
tccaaagggt 
gacagcgagt 
atgtttgaaa 
gctcgggtga 
atgcctccaa 
ggtgcttctg 
gatggcttct 
tcagtgtcta 
cctgtatgtg 
tgtgctgccc 
gccacagaag 
cctctggggc 
gggccccgcc 
gagtgcctcg 
ggcgagggca 
cttcttcaga 
gagtgtgcag 
cgtccgctcg 
gtgcgcttac 
ctggcaccca 
tgggtggtgg 
gtgctggtca 
atcacttcac 
gcccatattc 
attgatgtgc 
tactttgcca 
gtgatcattc 
cactggtacc 
gacttcttca 
ctggtgatca 
aagattgaca 



agagattctc 
tggctaattt 
gctttctata 
gaatttgcca 
cgaaagatta 
tgaagaaatt 
ccacactctc 
tctctctctg 
aaaactgtct 
gtgctgagga 
gggaatacac 
ggcaagttcg 
gccagcgtgt 
tgtttgtttc 
gtgtgagagc 
tggctgtggt 
cgtgcctcgg 
tctgccttga 
agccaaaagc 
cagccagggt 
cgggccagcc 
ccccttcgcg 
gttcccctaa 
gcacgggctg 
gacgagtccc 
cggcccgcag 
ctgccagact 
atagaagcca 
gcatgggcat 
tcacagccat 
tggcctgtgc 
ttatgaaaat 
tgtgcgtcac 
ttacttcacc 
tgatggtgtg 
gggccaccca 
cgaaccaagc 
tggtcttcgt 
aatctgaggg 



ctgtctcagc 
ttgtattttt 
gcttcaaaat 
ttttggtaag 
acaaactatc 
gtttgaattc 
aagagttaaa 
tcccaaaata 
tcatgcctgc 
aatcggcagc 
tgggttaccg 
gagtacccag 
gtttactttc 
tggccgcgtt 
gcgcgcgagt 
tcggtataag 
tgggcactct 
gacctcaagc 
tcccgggttg 
agccgggaag 
ccaggagaag 
gctgccggcg 
agtcctgtgc 
gaactggcag 
caccacaccc 
agccccgccg 
gcgcgccatg 
tgcgccggac 
cgtcatgtct 
tgccaagttc 
tgatctggtc 
gtggactttt 
ggccagcatt 
tttcaagtac 
gattgtgtca 
ccaggaagcc 
ctatgccatt 
ctactccagg 
ccgcttccat 



ctcccgagta 
agtagagaca 
gttcttaatg 
tcacagacgc 
cagcagatga 
tcattgcatc 
aacaaaacaa 
catacttgca 
aaattcctaa 
tgttgaagtc 
ggtgtatgtt 
atggagacat 
tgtgtgtgtc 
tctgtgttgg 
gtgcatgtcg 
tctgagcatg 
cgtttccttc 
cgcgcaggcg 
gctggtaagg 
cagtggtggc 
gagggcgagg 
tgccattggc 
acataacggg 
gcaccgcgag 
acaccacagc 
tgggtccgcc 
gggcaacccg 
cacgacgtca 
ctcatcgtcc 
gagcgtctgc 
atgggcctgg 
ggcaacttct 
gagaccctgt 
cagagcctgc 
ggccttacct 
atcaactgct 
gcctcttcca 
gtctttcagg 
gtccagaacc 



gctgggacta 
agagttacac 
ttaagacatt 
cagatggtgg 
aaggattttt 
tccagttcaa 
caaaaaaatt 
tacccccgct 
ggagggcacc 
acctcctgtg 
gggaggggag 
ccgtgtctgt 
accatgtctt 
acaggggtga 
gtgagctggg 
tctgccaggg 
cgaatgtggg 
cccagggcag 
acaccacctc 
ccgccctcca 
ggaggggagg 
cgaaagttcc 
cagaacgcac 
cccctagcac 
cgctgaatga 
cgctgaggcg 
ggaacggcag 
cgcagcaaag 
tggccatcgt 
agacggtcac 
cagtggtgcc 
ggtgcgagtt 
gcgtgatcgc 
tgaccaagaa 
ccttcttgcc 
atgccaatga 
tcgtgtcctt 
aggccaaaag 
ttagccaggt 



caggtacgtg 
catattggcc 
cttaatactc 
caatttcaca 
tttagtttca 
cagataatga 
aaaacaaaag 
ccagataaaa 
taaagtactt 
ctcttgccaa 
cattatcagt 
gtcgctctgg 
tgtgcttctg 
ctttgtgccg 
agggtgtgtc 
tgtatttgtg 
gcagtgccgg 
gcaggtagcg 
cagctttagc 
gggagcagtt 
gaaaggggag 
cgtacgtcac 
tgcgaagcgg 
ccgacaagct 
ggcttccagg 
cccccagcca 
cgccttcttg 
ggacgaggtg 
gtttggcaat 
caactacttc 
ctttggggcc 
ttggacttcc 
agtggatcgc 
taaggcccgg 
cattcagatg 
gacctgctgt 
ctacgttccc 
gcagctccag 
ggagcaggat 



1260 
1320 
1380 
1440 
1500 
1560 
1620 
1660 
1723 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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gggcggacgg ggcatggact ccgcagatct tccaagttct gcttgaagga gcacaaagcc 24 00 

ctcaagacgt taggcatcat catgggcact ttcaccctct gctggctgcc cttcttcatc 2460 

gttaacattg tgcatgtgat ccaggataac ctcatccgta aggaagttta catcctccta 2520 

aattggatag gctatgtcaa ttctggtttc aatcccctta tctactgccg gagcccagat 2580 

ttcaggattg ccttccagga gcttctgtgc ctgcgcaggc cttctttgaa ggcctatggg 2640 

aatggctact ccagcaacgg caacacaggg gagcagagtg gatatcacgt ggaacaggag 2700 

aaagaaaata aactgctgtg tgaagacctc ccaggcacgg aagactttgt gggccatcaa 2760 

ggtactgtgc ctagcgataa cattgattca caagggagga attgtagtac aaatgactca 2820 

ctgctgtaaa gcagtttttc tacttttaaa gacccccccc cccccaacag aacactaaac 2880 

agactattta acttgagggt aataaactta gaataaaatt gtaaaaattg tatagagata 2940 

tgcagaagga agggcatcct tctgcctttt ttattttttt aagctgtaaa aagagagaaa 3000 

acttatttga gtgattattt gctatttgta cagttcagtt cctctttgca tggaatttgt 3060 

aagtttatgt ctaaagagct ttagtcctag aggacctgag tctgctatat tttcatgact 3120 

tttccatgta tctacctcac tattcaagta ttaggggtaa tatattgctg ctggtaattt 3180 

gtatctgaag gagattttcc ttcctacacc cttggacttg aggattttga gtatctcgga 3240 

cctttcagct gtgaacatgg actcttcccc cactcctctt atttgctcac acggggtatt 3300 

ttaggcaggg atttgaggag cagcttcagt tgttttcccg agcaaaggtc taaagtttac 3360 

agtaaataaa atgtttgacc atgccttcat tgcacctgtt tgtccaaaac cccctgactg 3420 

gagtgctgtt gcctccccca ctggaaaccg c 3451 

<210> 139 
<211> 2305 
<212> DNA 
<213> Homo sapien 

<400> 139 

gaattcatgc cgcgtttctg tgttggacag gggtgacttt gtgccggatg gcttctgtgt 60 

gagagcgcgc gcgagtgtgc atgtcggtga gctgggaggg tgtgtctcag tgtctatggc 120 

tgtggttcgg tataagtcta agcatgtctg ccagggtgta tttgtgcctg tatgtgcgtg 180 

cctcggtggg cactctcgtt tccttccgaa tgtggggcag tgccggtgtg ctgccctctg 24 0 

ccttgagacc tcaagccgcg caggcgccca gggcaggcag gtagcggcca cagaagagcc 300 

aaaagctccc gggttggctg gtaagcacac cacctccagc tttagccctc tggggccagc 360 

cagggtagcc gggaagcagt ggtggcccgc cctccaggga gcagttgggc cccgcccggg 420 

ccagcctcag gagaaggagg gcgaggggag gggagggaaa ggggaggagt gcctcgcccc 480 

ttcgcggctg ccggcgtgcc attggccgaa agttcccgta cgtcacggcg agggcagttc 540 

ccctaaagtc ctgtgcacat aacgggcaga acgcactgcg aagcggcttc ttcagagcac 600 

gggctggaac tggcaggcac cgcgagcccc tagcacccga caagctgagt gtgcaggacg 660 

agtccccacc acacccacac cacagccgct gaatgaggct tccaggcgtc cgctcgcggc 720 

ccgcagagcc ccgccgtggg tccgcctgct gaggcgcccc cagccagtgc gcttacctgc 780 

cagactgcgc gccatggggc aacccgggaa cggcagcgcc ttcttgctgg cacccaatag 840 

aagccatgcg ccggaccacg acgtcacgca gcaaagggac gaggtgtggg tggtgggcat 900 

gggcatcgtc atgtctctca tcgtcctggc catcgtgttt ggcaatgtgc tggtcatcac 960 

agccattgcc aagttcgagc gtctgcagac ggtcaccaac tacttcatca cttcactggc 1020 

ctgtgctgat ctggtcatgg gcctggcagt ggtgcccttt ggggccgccc atattcttat 1080 

gaaaatgtgg acttttggca acttctggtg cgagttttgg acttccattg atgtgctgtg 1140 

cgtcacggcc agcattgaga ccctgtgcgt gatcgcagtg gatcgctact ttgccattac 1200 

ttcacctttc aagtaccaga gcctgctgac caagaataag gcccgggtga tcattctgat 1260 

ggtgtggatt gtgtcaggcc ttacctcctt cctgcccatt cagatgcact ggtaccgggc 1320 

cacccaccag gaagccatca actgctatgc caatgagacc tgctgtgact tcttcacgaa 1380 

ccaagcctat gccattgcct cttccatcgt gcccttctac gttcccctgg tgatcatggt 1440 

cttcgtctac tccagggtct ttcaggaggc caaaaggcag ctccagaaga ttgacaaatc 1500 

tgagggccgc ttccatgtcc agaaccttag ccaggtggag caggatgggc ggacggggca 1560 

tggactccgc agatcttcca agttctgctt gaaggagcac aaagccccca agacgttagg 1620 

catcatcatg ggcactttca ccctctgctg gccgcccttc ttcatcgtta acattgtgca 1680 

tgtgatccag gataacctca tccgtaagga agtttacatc ctcctaaatt ggataggcta 1740 
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tgtcaattct 
ccaggagctt 
caacggcaac 
gctgtgtgaa 
cgataacatt 
tttttctact 
gggtaataaa 
ccttctgcct 
tttgttattt 
gctttagtcc 



ggtttcaatc 
ctgtgcctgc 
acaggggagc 
gacctcccag 
gattcacaag 
tttaaagacc 
cttagaataa 
tttttatttt 
gtacagttca 
tagaggaccc 



cccttatcta 
gcaggtcttc 
agagtggaCa 
gcacggaaga 
ggaggaattg 
ccccccccca 
aattgtaaaa 
tttaagctgt 
gttcctcttt 
gagtc 



ctgccggagc 
tttgaaggcc 
tcacgtggaa 
ctttgtgggc 
tagtacaaat 
acagaacact 
tcgtatagag 
aaaaagagag 
gcatggaatt 



ccagatttca 
tatgggaatg 
caggagaaag 
catcaaggta 
gactcactgc 
aaacagacta 
atatgcagaa 
aaaacttatt 
tgtaagttta 



<210> 140 

<211> 4105 

<212> DNA 

<213> Homo sapien 



ggattgcctt 
gctactccag 
aaaataaacc 
ctgtgcctag 
tgtaaagcag 
tttaacttga 
ggaagggcat 
tgagtgatta 
tgtctaaaga 



1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2305 



<400> 140 

gaattcgcgg ccgcctcttg cggtcccaga gtggagtgga aggtctggag ctttgggagg 60 

agacggggag gacagactgg aggcgtgttc ctccggagtt ttctttttcg tgcgagccct 120 

cgcgcgcgcg tacagtcatc ccgctggtct gacgattgtg gagaggcggt ggagaggctt ' 180 

catccatccc acccggtcgt cgccggggat tggggtccca gcgacacctc cccgggagaa 24 0 

gcagtgccca ggaagttttc tgaagccggg gaagctgtgc agccgaagcc gccgccgcgc 300 

cggagcccgg gacaccggcc accctccgcg ccacccaccc ccgctttctc cggcttcctc 360 

tggcccaggc gccgcgcgga cccggcagct gtctgcgcac gccgagctcc acggtgaaaa 420 

aaaaagtgaa ggtgtaaaag cagcacaagt gcaataagag atatttcctc aaatttgcct 4 80 

caagatggaa accctttgcc tcagggcatc cttttggctg gcactggttg gatgtgtaat 540 

cagtgataat cctgagagat acagcacaaa tctaagcaat catgtggatg atttcaccac 600 

ttttcgtggc acagagctca gcttcctggt taccactcat caacccacta atttggtcct 660 

acccagcaat ggctcaatgc acaactattg cccacagcag actaaaatta cttcagcttt 720 

caaatacatt aacactgtga tatcttgtac tattttcatc gtgggaatgg tggggaatgc 780 

aactctgctc aggatcattt accagaacaa atgtatgagg aatggcccca acgcgctgat 84 0 

agccagtctt gcccttggag accttatcta tgtggtcatt gatctcccta tcaatgtatt 900 

taagctgctg gctgggcgct ggccttttga tcacaatgac tttggcgtat ttctttgcaa 960 

gctgttcccc tttttgcaga agtcctcggt ggggatcacc gtcctcaacc tctgcgctct 1020 

tagtgttgac aggtacagag cagttgcctc ctggagtcgt gttcagggaa ttgggattcc 1080 

tttggtaact gccattgaaa ttgtctccat ctggatcctg tcctttatcc tggccattcc 1140 

tgaagcgatt ggcttcgtca tggtaccctt tgaatatagg ggtgaacagc ataaaacctg 1200 

tatgctcaat gccacatcaa aattcatgga gttctaccaa gatgtaaagg actggtggct 1260 

cttcgggttc tatttctgta tgcccttggt gtgcactgcg atcttctaca ccctcatgac 1320 

ttgtgagatg ttgaacagaa nnaatggcag ctcgagaatt gccctcagtg aacatcttaa 13 80 

gcagcgtcga gaagtggcaa aaacagtttt ctgcttggtt gtaatttttg ctctttgctg 1440 

gttccctctt cacttaagcc gtatattgaa gaaaactgtg cataacgaaa tggacaagaa 1500 

ccgatgtgaa ttacttagtt tcttactgct catggattac atcggtatta acttggcaac 1560 

catgaattca tgtataaacc ccatagctct gtattttgtg agcaagaaat ttaaaaattg 1620 

tttccagtca tgcctctgct gctgctgtta ccagtccaaa agtctgatga cctcggtccc 1680 

catgaacgga acaagcatcc agtggaagaa ccacgatcaa aacaaccaca acacagaccg 174 0 

gagcagccat aaggacagca tgaactgacc acccttagaa gcactcctcg gtactcccat 1800 

aatcctctcg gagaaaaaaa tcacaaggca actgtgaccc cgggaatctc ttctctgatc 1860 

cttcttcctt aattcactcc cacacccaag aagaaatgct ttccaaaacc gcaaggtaga 1920 

ctggtttatc cacccacaac atctacgaat cgtacttctc taattgatct aatttacata 1980 

ttctgcgtgt tgtattcagc actaaaaaat ggtgggagcc gggggagaat gaagactgtt 2040 

aaatgaaacc agaaggatat ttactacttt tgcatgaaaa tagagctttc aagtacatgg 2100 

ctagctttta tggcagttct ggtgaatgtt caatgggaac tggtcaccat gaaactttag 2160 

agattaacga caagattttc tacttttttt aagtgatttc ttgtccttca gccaaacaca 2220 

atatgggctc aggtcacttt tatttgaaat gtcatttggc gccagtattt tttaactgca 2280 
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taatagccta acatgattat ctgaacttat ttacacatag tttgaaaaaa aaaagacaaa 2340 

aatagtattc aggtgagcaa ctagattagt attttccacg tcactattta tttttttaaa 2400 

acacaaattc taaagctaca acaaatacta caggccctta aagcacagtc tgatgacaca 2460 

tttggcagtt taatagatgt tactcaaaga attttttaag aactgtattt tattttttaa 2520 

atggtgtttt attacaaggg accttgaaca tgttttgtat gttaaattca aaagtaatgc 2580 

ttcaatcaga tagttctttt tcacaagttc aatactgttt ttcatgtaaa ttttgtatga 2640 

aaaatcaatg tcaagtacca aaatgttaat gtatgtgtca tttaactctg cctgagactt 2700 

tcagtgcact gtatatagaa gtctaaaaca cacctaagag aaaaagatcg aatttttcag 2760 

atgattcgga aattttcatt caggtatttg taatagtgac atatatatgt atatacatat 2820 

cacctcctat tctcttaatt tttgttaaaa tgttaactgg cagtaagtct tttttgatca 2880 

ttcccttttc catataggaa acataatttt gaagtggcca gatgagttta tcatgtcagt 2940 

gaaaaataat tacccacaaa tgccaccagt aacttaacga ttcttcactc cttggggttt 3000 

tcagtatgaa cctaactccc caccccaaca tctccctccc acattgtcac catttcaaag 3060 

ggcccacagt gacttttgct gggcattttc ccagatgttt acagactgtg agtacagcag 3120 

aaaatctttt actagtgtgt gtgtgtatat atataaacaa ttgtaaattt cttttagccc 3180 

atttttctag actgtctctg tggaatatat ttgtgtgtgt gatatatgca tgtgtgtgat 3240 

ggtatgtatg gatttaatct aatctaataa ttgtgccccg cagttgtgcc aaagtgcata 3300 

gtctgagcta aaatctaggt gattgttcat catgacaacc tgcctcagtc cattttaacc 3360 

tgtagcaacc ttctgcattc ataaatcttg taatcatgtt accattacaa atgggatata 3420 

agaggcagcg tgaaagcaga tgagctgtgg actagcaata tagggttttg tttggttggt 3480 

tggtttgata aagcagtatt tggggtcata ttgtttcctg tgctggagca aaagtcatta 3540 

cactttgaag tattatattg ttcttatcct caattcaatg tggtgatgaa attgccaggt 3600 

tgtctgatat ttctttcaga cttcgccaga cagattgctg ataataaatt aggtaagata 3660 

atttgttggg ccatatttta ggacaggtaa aataacatca ggttccagtt gcttgaattg 3720 

caaggctaag aagtactgcc cttttgtgtg ttagcagtca aatctattat tccactggcg 3780 

catcatatgc agtgatatat gcctataata taagccatag gttcacacca ttttgtttag 3840 

acaattgtct ttttttcaag atgctttgtt tctttcatat gaaaaaaatg cattttataa 3900 

attcagaaag tcatagattt ctgaaggcgt caacgtgcat tttatttatg gactggtaag 3 960 

taactgtggt ttactagcag gaatatttcc aatttctacc tttactacat cttttcaaca 4020 

agtaactttg tagaaatgag ccagaagcca aggccctgag ttggcagtgg cccataagtg 4080 

taaaataaaa gtttacagaa acctt 4105 
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